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GALECTIN EXPRESSION IS INDUCED IN CIRRHOTIC UVER AND HEPATOCELLULAR CARCINOMA 

FIELD OF THE INVENTION 

The present invention relates to the discovery of a marker for fiver disease. Novel diagnostics, prognostics* 
therapeutics and methods of use of .the foregoing for the treatment and prevention of hepatocellular carcinoma are 
5 also disclosed. 

BACKGROUND OF THE INVENTION 
Hepatocellular carcinoma is a major type of cancer causing a quarter of a million deaths worldwide each year 
(Kew, M.C., Tumors of the fiver. In: 0. Zakrm and T.D. Boyer (eds.), Hepatobgy. A Textbook of Liver Disease, pp. 
1206-1240, W.B. Saunders, Philadelphia (1990)). While various factors have been identified as causes for HCC, the 

10 major established factors are infections of hepatitis viruses B (HBVI and C (HCV). Various potential mechanisms exist 
whereby infection by HBV can result in HCC. These include both chronic liver injury caused by cytotoxic T ceil 
responses to infected hepatocytes and intracellular occlusion resulting from expression of viral protein (Chisari, F.V., 
Analysis of hepadnavirus gene expression, biology, and pathogenesis in the transgenic mouse. In: Current Topics in 
Microbiology and Immunology, pp. 85-99, Springer-Verlag, Berlin (1991)). 

15 Deregulated expression of a number of proteins such as tumor suppressor genes and altered cellular activity 

of other cellular genes have been associated with HCC. (Robinson, W.S., JMastroenterol.Hapatol. 8:95 (1993)). 
Transactivation fay HBV-X protein is also considered to significantly contribute to cell transformation. This protein has 
been shown to transactivate several viral and cellular genes such as tjun, c-myc. /^interferon and class I 
histocompatibility complex. (Robinson, W.S., JMastroenteroLHepatol, 8:95 (1993)). The transcription factors CREB 

20 and ATF2 and various cis-acting ONA elements that regulate cellular genes, such as kB, API and AP2, are also 
affected by HBV-X. (Robinson, W.S.. J.GastrventeroUiepatol, 8:95 (1993)). Although significant strides have been 
made toward understanding the onset of HCC, the need for sensitive diagnostics, prognostics, and therapeutics for 
the treatment and prevention of HCC, is manifest. 

BRIEF SUMMARY OF THE INVENTION 

25 Aspects of the present invention relate to the discovery that hepatocytes in liver biopsies from subjects with 

hepatocellular carcinoma (HCC) have significant levels of galectin-3 (76% irnmunohbtocterrncally positive) white 
galectin-3 expression is low or absent in normal hepatocytes. Notably, the correlation of galectin-3 expression and 
HCC is independent of whether the subject had prior hepatitis B virus infection (14 of 18 HCC cases from HBV* 
patients, and 5 of 7 cases from HBV- patients demonstrated positive galectin-3 immunohistochemistry). This 

30 indicates that galectin-3 expression may be used to detect HCC resulting from a variety of factors. However, co- 
transfection studies, using a galectin-3 promoter construct and an HBV-X protein (HBV-X) expression vector 
demonstrated that galectin-3 expression can occur through transactivation of the galectin-3 promoter by HBV-X. 
Additionally, aspects of the present invention relate to the discovery that galectin-3 is abundantly expressed in 
cirrhotic liver in a peripheral distribution within regenerating nodules. The galectin-3 expression in rapidly proliferating 

35 hepatocytes in cirrhotic liver can be used to identify an early neoplastic event. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
FIGURE 1 shows the results of two chloramphenicol acetyl transferase assays in which the transactivation 
of the galectin-3 promoter by cotransf ection with an HBV-X protein expression vector is demonstrated. 
FIGURE 2 shows the cONA sequence of human galectin-3. 
5 DETAILED DESCRIPTION OF THE INVENTION 

In the present invention, several embodiments relate to the discovery that hepatocytes in liver biopsies from 
subjects with hepatocellular carcinoma (HCC) have significant levels of galectin-3 (76% immunohistochemicaily 
positive); whereas, galectin-3 expression is low or absent in normal hepatocytes. In addition, galectin-3 expression in 
HCC is independent of whether the subject had prior hepatitis 6 virus infection (14 of 18 HCC cases from HBV+ 
1 0 patients, and 5 of 7 cases from K8V- patients demonstrated positive galectin-3 rmmunohistochemistry). Furthermore, 
cotransfection of a galectin-3 promoter construct and an HBV-X protein (HBV-X) expression vector induces galectin-3 
expression, through transactivation of the lectin promoter by HBV-X. Additionally, galectin-3 expression in 
regenerating nodules is also associated with cirrhosis of the fiver. Herein we provide novel diagnostics and 
prognostics for determining whether a subject is suffering from HCC or cirrhosis of the liver or is likely to develop HCC 
15 or cirrhosis of the liver. Therapeutic agents or vaccine components for the treatment and/or prevention of HCC or fiver 
cirrhosis are also provided. 

Galectins are a family of proteins characterized by sequence homologies in a domain which exhibits 
galactose-specific carbohydrate binding activity. There are presently 12 characterized eukaryotic members, and 
currently a total of two dozen members have been predicted by homology from ONA sequences present in databases. 

20 Several dozen human members can be anticipated from projections based on the number of similar members present in 
the completely sequenced Caenorhabtfitis eiegans genome. Galectins do not contain traditional sequences that specify 
membrane translocation, but are both secreted and located intraceliularry. Of the characterized galectins, galectins- 1 
and 3 are the most extensively studied (Barondes, et al., JJMCtonx, 269:20807 (1994); Kasai, K. and Hirabayashi, 
J..J.Bh>diem.(Tokyo). 119:1 (1996)). 

25 Galectin-3, previously designated as eBP, CBP35, Mac-2, L-29 and 1-34, has been associated with assorted 

processes such as cell growth, tumor transformation, and metastasis. (Moutsatsos, et aL, ProcNatlMad.ScLUSA, 
84:6452 (1987), (Raz, et al.. Cancer Res., 46:3667 (1986). (Raz, et aL, InUCaitcer. 46:871 (1990), and (Hsu, et aL, ^ 
AmJ.Pathol., 148:1661 (19961). This lectin of approximately 30,000 Oa is composed of two domains: a carboxyf- 
terminal domain that contains the carbohydrate-binding region which binds saccarides with terminal galactose 

30 residues and an amino-terminal domain consisting primarily of tandem repeats of nine amino acids (Liu, et aL, 
Biochemistry, 35:6073 (1996)). Galectin-3 is expressed in various tissues and organs and is present in dendritic cells 
and a variety of norma) cells of epithelial origin but is undetectable in normal lymphocytes and hepatic cells. 

One aspect of the present invention relates to methods of determining whether an individual is suffering 
from HCC or cirrhosis of the fiver, or is likely to suffer from HCC or cirrhosis of the liver, comprising determining 

35 whether the level of galectin-3 expression in the subject's hepatocytes is above normal. Example 1 describes the 
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immunohhtochemical characterization of galectin-3 expression in samples from normal liver, subjects with cirrhosis of 
the Over, and subjects with HCC. Pathological specimens were obtained from archives at the Medical Center of the 
University of California, San Diego, or the Veterans General Hospital Taichung, Taiwan. Cases studied included 
normal liver (six cases), cirrhotic liver (eight cases), hepatocellular carcinoma (25 cases), fibrolamellar tumor (two 
cases) and hepatoblastoma (one case). 

EXAMPLE 1 

The expression of gatectin-3 in normal human liver and cirrhotic human liver biopsy tissue was analyzed by 
immunohistochemistry. These normal liver biopsies were obtained from normal subjects or regions in HCC patients 
unaffected by tumor. Tissues were fixed m either paraformaldehyde (for samples probed with rabbit antibody) or BS 
(for samples probed with monoclonal antibody) and embedded in paraffin. Tissue sections were processed for 
immunohistochemical detection of gaIectin-3, as described previously (Konstantinov, et al„ AmJ.PathoL, 148:25 
(1996)). In brief, primary antibodies used were affinity-purified rabbit anti-galectin-3 antibody or mouse monoclonal 
anti galecttn antibody A3A12 (Liu, et al.. Biochemistry, 35:6073 (1996)), both at 10 //g/mL Secondary antibodies 
were goat antkabbit peroxidase and goat anti-mouse peroxidase conjugates, respectively. Antibody controls used 
were normal rabbit immunoglobulin and isotype matched irrelevant monoclonal antibodies, and did not result in any 
staining. Chromogenic substrates were 3,3'-diaminobenzidine or 3-amino-9-ethylcarbazolB. 

A total of four normal samples were probed with affinity purified rabbit polyclonal anti-galectin-3. Positive 
reactivities were demonstrated by brown precipitates (peroxidase-diaminobenztdine) and counterstaining was done 
with hematoxylin. Prominent staining was noted in epithelial ceils lining the bite ducts and ceils with the appearance 
and distribution of Kuppfer Cells. In contrast, hepatocytes did not show any significant level of staining. Nor did 
connective tissue show any staining. The negative controls for the experiment used non-immune rabbit gamma 
globulin instead of the specific primary antibodies and consistently showed only background staining. Two normal 
biopsies were also reacted with monoclonal antibodies and similar reactivities to galectin-3 as that obtained with the 
polyclonal antibody were obtained. The observed pattern of galectin-3 expression m normal human liver is consistent 
with that reported for mouse Gver. (Flotte, et *\. t AmJJ>athol., 111:1 12 (1983)). 

The most prominent feature observed m the eight biopsies from subjects having cirrhosis of the liver was the 
strong staining of focal regenerating nodules. Variable degrees of staining intensity were seen in different foci of 
regenerating liver. Preferential circumferential expression of galectin-3 in the foci was also observed, suggesting 
differential regulation. Islands of positively stained cells, and general distribution of other positively staining cells were 
also observed throughout nodules. In all samples of cirrhotic liver, both cytoplasmic and nuclear expression of 
galectin-3 were observed. Other regions of hepatic tissue do not display this staining pattern. The ductal epithelial 
cells, dendritic cells, and Kuppfer cells were also positively stained in the tissue specimens from cirrhotic Over. In the 
discussion that follows, we describe an experiment in which we discovered that galectin-3 is expressed in human 
hepatocellular carcinoma. 
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The pattern of galectin-3 expression in human hepatocellular carcinoma was visualized in much the same 
manner as described above for normal and cirrhotic liver. The pattern of galectin-3 expression was determined by 
using an affinity purified rabbit anti-galectin-3 antibody as the primary antibody and positive reactivities were 
demonstrated by reddish-brown precipitates (peroxidase-aminoethylcarbazole). In addition, a mouse monoclonal anti- 
5 galectin-3 antibody was used and positive reactivities were observed by brown precipitates (peroxidase- 
diaminobenzidinel. In both sets of experiments, counterstaming was done with hematoxylin. 

The immunohistochemical staining of tumor cells revealed varying degrees of intensity and distribution. 
Based on the intensity and distribution of positive staining regions, scores from 1 to 4 were assigned: One* 
(scattered positive tumor cells); 2+ (diffuse positive or locally positive); 3+ (large areas of positives, diffuse and 

10 focal); and 4+ (mostly positive and strongly positive). In most of the HCC tissues (76%) positive galectin-3 
immunoreactivities of scoring intensity 1 and greater were observed in the neoplastic cells. (See Table I). In each 
HCC tissue sample, localization of galectin-3 was predominantly cytoplasmic but nuclear staining was invariably noted 
in many cells, and was observed in all samples positive for galectin-3 expression. The neoplastic cells were not 
uniformly positive and tissues from different cases were variable in terms of the fraction of the tumor cells that were 

15 positively stained. In some specimens, however, the neoplastic ceils were not stained despite positive staining of the 
bile ducts and Kuppfer cells. Specificity of the rabbit anti-galectin-3 antibody was verified by immunohistochemistry 
using a mouse monoclonal antibody. Two HCC biopsies were stained with both polyclonal and monoclonal antibodies 
and similar staining patterns in tissues expressing the lectin were observed. For most cases of hepatocellular 
carcinoma, HBV serology of the patients was available, and are included in Table I. Positive HBV infection status is 

20 defined by presence of serum HBV surface antigen or antibodies to HBV surface antigen. Positive and negative sera 
status are indicated as V and respectively, and no signage when no information was available (Table I). Of the 
25 HCC biopsies studied, 18 were from documented HBV seropositive patients. Seventy-eight percent of the 18 
biopsies showed positive immunohistochemical reactivities for galectin-3. Seven cases of HCC occurred in HBV sero 
negative patients, five of which were positively stained for galectin-3. Thus, galectin-3 expression in hepatocytes is a 

25 general marker for HCC, and is not limited to HCC resulting from HBV infection. The HCC samples studied for 
galectin-3 expression and the serological status for HBV antigens are summarized in Table I. 
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Table I 



Galectin-3 Irrununohistochemical Reactivities in HCC Tissue 
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35 

Samples classified as HBV positive as shown by serology are indicated in bold type. 

1 Fibrolamellar tumor 

2 Hepatoblastoma 
Summary of HCC Statistics 

40 GalectM Positive Galectin-3 foqtfve 

HBV* 14 4 

HBVWundetermined 5 2 

Abbreviations: HBsAg, HBV surface antigen; HBeAg, HBV envelope antigen; a-HBs, antibodies to HBV surface antigen; a- 
HBe, antibodies to HBV envelope antigen; a-HBec, antibodies to HBV core antigen. 
45 Another aspect of the present invention relates to methods of determining the pattern of galectin-3 

eipression in hepatocellular carcinoma cell Gnes, comprising identifying whether the level of galectin-3 expression in 
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tha cells is above normal. Example 2 describes the characterization of galectin-3 expression in several hepatocellular 
carcinoma cell lines. 

EXAMPLE 2 

The association of galectin-3 expression with HCC was also confirmed in established HCC cell lines by 
5 immunoblot. As a control organ extracts from mouse liver, as well as. thymus and spleen were analyzed. The human 

hepatoma ceil Tines Hep 3B (HBsAg + ), Hep G2 (Aden, et aL, Nature. 282:615 (1979}). HuH-7 (Nakabayashi. et a!.. 

Cancer Res., 42:3858 (1982)) and PLC/PRF/5 (HBsAg+l (Alexander, et aL. SMUMed.Sc/.. 41:89 (1976)) were 
kindly provided by Or. F. Chisari. The Scripps Research Institute. La Jolla. California. The Ml 2.4.5 line was a 
generous gift of Dr. Richard Asofsky. National Institutes of Health. Bethesda. Maryland. Cells were cultured in 

1 0 Dulbecco's Modified Eagle's Medium supplemented with 1 0 % fetal bovine serum and 2 mM glutamine. 

Galectin-3 expression in mouse tissues and hepatoma cell lines was detected by immunoblot tmg, as 
described previously (Liu. et aL. AmJ.Pathol. % 147:1016 (1995)). Briefly, ceil lysates were adsorbed with lactosyl- 
Sepharose 4B and the bound proteins were eluted and electrophoresed in 12.5 % potyacrylamide gel. The separated 
proteins were transferred to fmmobilon P membranes and galectin-3 was detected by a chemHuminescent system 

15 using a rabbit antt-galectm-3 antibody and alkaline phosphatase-conjugated goat anti-rabbit antibody. Each lane 
represents the galectin-3 present in 100 //g protein from tissue extracts and 500 //g protein from each cell lysate. 
Protein markers are shown on the left in kDa. 

The control immunoblot showed that galectin-3 was expressed in spleen and thymus extracts but not fiver. 
The immunoblot of the HCC cell lines showed that galectin-3 was expressed in all four HCC ceil fines tested. The 

20 identity of the protein was also verified by its lectin activity, as demonstrated by binding to lactosyl-Sepharose 48. co- 
migration with the authentic galectin-3 in electrophoresis, and reactivity with specific rabbit antibodies in immunoblot. 
Two HCC ceO lines used. Hep 3B and PLC/PRF/5 are known to contain integrated HBV sequences (Aden, et at. 
Nature. 282:615 (1979); MacNab. et aL. British J. Cancer, 34:509 (1976)). However, there is no evidence that the 
other cell lines. Hep G2 and HuH-7 are infected with HBV (Aden, et aL. Nature. 282:615 (1979); Nakabayashi. et aL. 

25 Cancer Res.. 42:3858 (1982)). Galectin-3 expression appeared to be generally associated with HCC but is not limited 
to HCC caused by HBV. 

While galectin-3 expression is not limited to HCC resulting from HBV. HBV-X protein induces galectin-3 
expression, as described in the example below. 

EXAMPLE 3 

30 The expression of galectin-3 in HBV seropositive individuals with HCC prompted us to test whether the 

galectin-3 promoter can be activated by HBV transacting proteins. (See Figure 1). Accordingly, the vector 
pGalec3-CAT, which contains the murine galectin-3 promoter coupled to a chloramphenicol acetyl transferase (CAT) 
reporter gene, was used to co-transfect Ml 2.4.5 cells with a vector expressing the HBV transactivating protein HBV- 
X. Construction of the vector pGatec3-CAT consisting of the murine galectin-3 promoter upstream of a bacterial 

35 chloramphenicol acetyl transferase (CAT) gene was accomplished, as described by Hsu. et aL. AmJJPathoL. 
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148:1661 (1996). Briefly, a 2 kb region upstream from the transcription initiation site was inserted into a vector 
housing a CAT gene. The HBV-X expression vector pARVIMT and parent pMT (McLachJan, et al., J.VtroL. 61:683 
(1987)1 used for cotransfections, were generous gifts of Or. Alan McLachlan, The Scripps Research institute, and 
contained the murine metallothionein-l gens promoter sensitive to and inducible with metal ions. Co-transfections of 
5 pGalec3-CAT and pARVIMT or pMT into murine M12.4.5 cells were accomplished by electroporation, as described by 
Hsu. et at AmJ.PatM t 148:1661 (1996). Induction of the metallothionein promoter was initiated in some cultures 
twenty four hr after electroporation by addition of ZnCI 2 to 10 //M. CAT activities of transfectants were assayed 48 

M 

hr after transfection using thB phase transfer method with ldichloroacetyH,2- ^chloramphenicol (New England 
Nuclear. Boston, MA) and n-butyl coenzyme A (Sigma, St. Louis. MO). Corrections were made for basal activity by 

1 0 subtracting the activity of the parental pCAT-basic vector from all other measurements. 

The CAT activity shown in Figure 1 is a measure of the activity of the galectin-3 promoter in the pGalec3- 
CAT vector in Ml 2.4.5 host ceils. Cotransfections were performed with the HBV-X expression vector pARVIMT or 
control pMT. Cotransfections performed with 10 j/g or 20 /sg pARVIMT in the presence of zinc are indicated by 
pARVIMTMO and pARV1MT/20, respectively, and pARV1MT/20(Zn) when performed with 20 vector in the 

15 absence of zinc. Significant upregutation of the galectin-3 promoter is observed when cotransfected with pARVIMT, 
but not with pMT. AH values were corrected for basal CAT activity reflective of transfection with the promoter-less 
CAT expression vector, and are expressed relative to pGalec3-CAT alone (unity). Oata are means of duplicate 
transf ections shown with error bars indicating SO and are representative of 2 experiments. 

As shown in Figure 1, co-transfection of the vector pGa!ec3-CAT and the vector expressing HBV-X protein 

20 resulted in upregulation of the galectin-3 promoter as evident from increased CAT activity. The vector expressing 
HBV-X protein contains a metallothionein gene promoter and is sensitive to and inducible by metal tons. In the 

presence of Zn* + , however, suppression of CAT activity was observed, indicating that activation of the murine 
metallothionein-l promoter driving expression of the HBV-X protein had already been achieved. 

The results presented above show that many normal hepatocytes. which do not express galectin-3, are 

25 converted to galectin-3 expressing cells upon neoplastic transformation. Of the HCC biopsies evaluated, 19 of 25 
(76%) were immunohistochemically positive for galectin-3. While not all neoplastic cells m HCC express this lectin 
and the cases of HCC studied fall into a spectrum in terms of the proportion of tumor ceils that are positively stained 
for galectin-3; galectin-3 expression is associated with HCC and cirrhosis of the liver. The presence of galectin-3 in 
HCC cell lines indicates that upregulation of this lectin occurs in transformed hepatocytes. 

30 Analysis of galectin-3 expression in HCC from patients with or without serological evidence of HBV infection 

revealed that galectin-3 expression in HCC is not strictly correlated with HBV infection (positive galectin-3 expression 
in 78% of HBV ♦ patients and 71% of HBV- patients). In addition, since at least one of the cell lines shown to express 
galectin-3 does not appear to exhibit evidence of HBV infection, the data suggest that factors other than HBV 
infection can result in galectin-3 expression. Additional evidence indicating non-viral upregulation of galectin-3 

35 expression in hepatic tissue exists in the observation that fibrolamellar tumors also express galectin-3. These tumors 
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generally occur in young patients, are not associated with chronic infection with HBV, and do not usually appear in 
cirrhotic liver (Berman. et aL, Cancer. 46:1448 (1980); Craig, et aL. Cancer, 46:372 (1980)). 

The following discussion is offered only to provide possible explanations for the relationship of expression of 
galectin-3 to neoplasia. These explanations are not intended to limit any aspect of the present invention nor are they 
5 intended to be used to construe, interpret, or restrict the scope of protection provided by the claims and/or the scope 
of equivalent compositions and methods. 

Results from our cotransfection experiments show that the upregulation of galectin-3 in HBV infected and 
transformed ceils can result from transactivation of the galectin-3 promoter by the HBV-X protein. No direct 
sequence-specific interaction of this protein occurs with DNA (Wu, et aL, Cell, 63:687 (1990)). The effects of HBV-X, 

10 however, can be mediated by interactions with transcription factors CREB and ATF2 (Maguire, et aL, Science. 
252:842 (1991)). In addition, HBV-X can transactrvate ONA elements regulating viral and cellular genes such as the 
kB sequence bound by the transcription factor NF-tcB (Twu, et aL. Proc.NatiAcad.SciMSA, 86:5168 (1989), the c- 
myc gene promoter (Koike, et aL, MoiecMfolJtied., 6:151 (1989) and binding sites for API and AP2 transcription 
factors (Seto, et aL, Nature. 344:72 (1990)). Indeed, several potential ONA sites for interaction with NF-kB, AP1 and 

15 AP2 were identified within 1 kb upstream of the transcription initiation site. The transactivation of the galectin-3 
gene by HBV-X is, therefore, consistent with our data demonstrating upregulation of this promoter by HTLV-i Tax 
(Hsu, et aL, Am J. Pathol. t 148:1661 (1996), a potent viral transactivator through the CREB/ATF transcriptional factor 
pathway. Furthermore, upregulated expression of galectin-3 was also demonstrated in 3T3 cells following 
transformation with the Kirsten murine sarcoma virus (Crittenden, et aL, MoLCeBBioL. 4:1252 (1984)). Additionally, 

20 analysis of fibroblasts from mouse and chicken for gaiectin-3 expression showed that infection with viruses can result 
in upregulated expression of the lectin (Crittenden, et aL, MoLCell 8ioL, 4:1252 11984)). In sum, we believe that 
galectin-3 expression may be affected by many viruses. 

The finding that regenerating nodules in cirrhotic liver express galectin-3 is also worthy of comment. Since 
these hepatocytes are undergoing rapid proliferation, we believe that this proliferative cell status is linked to galectin- 

25 3 expression, Previously, others reported that elevated levels of this lectin are found in proliferating fibroblasts 
(Moutsatsos, et aL, ProcJlfetlMadSci. USA. 84:6452 (1987)). An alternative but not mutually exclusive explanation 
may be the following. Because oval ceils expressing markers for both bile duct epithelia and hepatocytes (Gerber. et 
aL, AmJJPathol., 110:70 (1983)) are hypothesized to function as hepatic stem cells (Fausto, N. and Thompson, HJL, 
Purification and culture of oval cells from rat liver. In: Pretlow II, T.G. and T.P. Pretlow (edsj. Ceil Separation: 

30 Methods and Selected Applications, pp. 45-96, Academic Press, Orlando (1986)) and since galectin-3 is expressed in 
cells of ductal origin, presence of this protein in these nodules may be residual lectin resulting from the production of 
hepatocytes from these stem cells. In either event, we believe that regulation of galectin-3 expression during 
differentiation of hepatocytes relates to the fate of the cells, and deregulation, as a result of viral influences, 
interferes with this process. Further, we contemplate that ceils positively stained for galectin-3 are in the process of 
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becoming cancer cells and that expression of galectin-3 represents an early event leading to tumor transformation. 
Alternatively, expression of galectin-3 in cirrhotic cells may indicate conversion to normalcy. 

In the following discussion, we provide several diagnostic and prognostic tools for use with embodiments of 
the present invention. 

5 

Diagnostic tools 

While normal hepatocytes do not express galectin-3, expression of this lectin is associated with human 
hepatocellular carcinoma (HCC). Lectin expression in HCC is positively influenced by hepatitus B virus (HBV) infection 
through a mechanism that includes transactivation of the galectin-3 genB promoter. Other areas in the Over found to 

10 express galectin-3 are focal regenerating nodules of cirrhotic tissue. We contemplate that deregulated expression of 
galectin-3 results in transformation, conversion of tumors to increased invasiveness, and confers preferential survival 
of tumors. Accordingly, several diagnostic and prognostic tools which detect expression of galectins in the fiver may 
be used to determine whether an individual is suffering from HCC or cirrhosis of the liver or is likely to suffer from 
HCC or cirrhosis of the liver in the future. 

15 Generally, the diagnostics and methods of use thereof can be classified according to whether the diagnostic 

detects the expression of galectin-3 protein in a biological sample (e.g., a sample having hepatocytes) or the expression 
of galectin-3 HNAina biological sample. Accordingly, the level of expression of galectin-3 protein or RNA in a biological 
sample indicates a predilection to liver disease le.g., cirrhosis or HCC). Once a biological sample from a subject in need 
of testing is obtained, many different techniques can be used to detect the level of galectin-3 protein and/or RNA 

20 expression including, but not limited to, antibody-based detection techniques, bacteriophage display techniques, lectin- 
binding techniques, hybridization techniques, and enzymatic digestion (e.g.. RNAse protection) techniques. 
Additionally, we contemplate the use of physical detection methods including, but not limited to, absorption, emission, 
or resonance spectra, and the identification of galectins by nucleic acid or protein fragmentation patterns (e,g., 
enzymatic or chemical cleavage patterns). Some of these techniques involve disposing the proteins and/or nucleic 

25 acids present in the biological sample on a support, and contacting the support with detection components such as 
antibodies to galectin-3 or nucleic acid probes complementary to galectin-3 mRNA. Desirably, the levels of expression 
of galectin-3 protein and/or RNA from diseased and normal individuals is compared to the level detected in the subject 
tested. 

Additionally, we contemplate the preparation of diagnostic kits comprising detection components such as 
30 antibodies specific for the galectin -3 protein or nucleic acid probes for detecting RNA encoding galectin-3. The 
detection component will typically be supplied in combination with one or more of the following reagents. A support 
capable of absorbing or otherwise binding RNA or protein will often be supplied. Available supports for this purpose 
include, but are not limited to, membranes of nitrocellulose, nylon or derivatized nylon that may be characterized by 
bearing an array of positively charged substituents. One or more enzymes, such as Reverse Transcriptase and/or Taq 
35 polymerase, may be furnished in the kit, as may dNTPs. buffers, or non-human polynucleotides like caH-thymus or 
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salmon-sperm ONA. Results from the kit assays can be interpreted by a healthcare provider or a diagnostic 
laboratory. Alternatively, diagnostic kits are manufactured and sold to private individuals for self-diagnosis. 

In many aspects of the present invention, galectins or RNA encoding galectins obtained from a biological 
sample are disposed on a support so that the level of galectin expression can be rapidly determined. The term 
5 "matrix" or "support" refers to a carrier, a resin or any macromolecular structure used to attach, immobilize, or 
dispose thereon a biomolecule such as RNA or protein. Solid supports include, but are not limited to, the walls of 
wells of a reaction tray, test tubes, polystyrene beads, magnetic beads, nitrocellulose strips, membranes, 
microparticies such as latex particles, sheep (or other animal) red blood cells, duracytes® and others. Additionally, 
organic carriers including proteins and oligo/polysaccarides <e.g. cellulose, starch, glycogen, chitosane or aminated 

10 sepharose) and inorganic carriers such as silicon oxide material (e.g. silica gel, zeolite, diatomaceous earth or aminated 
glass) are contemplated. Furthermore, in some embodiments, a liposome or lipid bilayer (natural or synthetic) is 
contemplated as a support. Desirable supports also include polyecrylamide gels, agarose gels, composite gels, and 
other gel matrices, papers, chips, membranes, chromatography matrices, as used in thin layer chromatography, and 
resins or beads, as used in affinity chromatography. 

15 The support may have a hydrophobic surface which interacts with a portion of the biomolecule by 

hydrophobic non-covalent interaction. For example, the hydrophobic surface of the support may be a polymer such as 
plastic or any other polymer in which hydrophobic groups have been finked such as polystyrene, polyethylene or 
polyvinyl. The support may also have a charged surface which interacts with the biomolecule such as, a charged 
nitrocellulose or nylon membrane. The supports may have other reactive groups which can be chemically activated so 

20 as to attach a biomolecule. For example, cyanogen bromide activated matrices, epoxy activated matrices, thio and 
thiopropyi gels, nitrophenyl chloroformate and Nhydroxy succinimrde chlorformate linkages, and oxirane acrylic 
supports are common in the art. (Sigma) 

Any biomolecule which interacts with a galectin or a nucleic acid sequence complementary to a nucleic acid 
encoding a galectin (e.g., antibody, phage, P-galactoside sugar, other galectin ligand, or DNA or RNA), may be 

25 attached in overlapping areas or at random locations on the solid support and can be used to probe for the presence of 
a galectin or RNA encoding a galectin in a biological sample. Additionally, biomotecules which interact with galectins 
or nucleic acid sequences encoding galectins may be attached in an ordered array wherein each polynucleotide is 
attached to a distinct region of the solid support which does not overlap with the attachment site of any other 
biomolecule. Preferably, such an ordered array of biomlecules is designed to be "addressable" where the distinct 

30 locations are recorded and can be accessed as part of an assay procedure. Addressable biomolecule arrays typically 
comprise a plurality of different biomolecule probes that are coupled to a surface of a substrate in different known 
locations. The knowledge of the precise location of each biomolecule location makes these "addressable" arrays 
particularly useful in binding assays. 

Any addressable array technology known in the art can be employed with this aspect of the invention. One 

35 particular embodiment of polynucleotide arrays is known as the Genechips™, and has been generally described in US 
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Patent 5,143,854; PCT publications WO 90/15070 and 92/10091 These arrays may generally be produced using 
mechanical synthesis methods or light directed synthesis methods, which incorporate a combination of 
photolithographic methods and solid phase oligonucleotide synthesis (Fodor et al.. Science, 251:767777, 1991). The 
immobilization of arrays of oligonucleotides on solid supports has been rendered possible by the development of a 
5 technology generally identified as "Very Large Scale Immobilized Polymer Synthesis" (VlSlPS n> ) in which, typically, 
probes are immobilized in a high density array on a solid surface of a chip. Examples of VLSIPS™ technologies are 
provided in US Patents 5,143.854 and 5,412,087 and in PCT Publications WO 90/15070, WO 92/10092 and WO 
95/11995, which describe methods for forming oligonucleotide arrays through techniques such as light-directed 
synthesis techniques. In designing strategies aimed at providing arrays of nucleotides immobilized on solid supports, 

10 further presentation strategies were developed to order and display the oligonucleotide arrays on the chips in an 
attempt to maximize hybridization patterns and sequence information. Examples of such presentation strategies are 
disclosed in PCT Publications WO 94/12305, WO 94/1 1530, WO 97/29212 and WO 97/31256. 

In the following discussion, we provide methods and compositions which are used to detect the presence of 
galecttn-3 protein in a biological sample so that the predisposition to NCC can be determined. 

1 5 Detection ofgaiectm-3 expression using protein-based diagnostics 

By one approach galectin -3 expression is determined by using protein-based Diagnostics. The presence end 
amount of gatectin -3 protein in a biological sample can be detected by screening for the presence of the protein using 
conventional assays. Antibodies (monoclonal or polyclonal) immunoreactive with gatectin -3 protein can be used to 
screen biological samples such as serum, urine, ascites, cerebrospinal fluids or any other body fluid. Preferably, liver 

20 cells, extracts of liver cells, or fiver biopsy tissue, as described above, are screened for the presence and amount of 
galectin-3. Portions, fragments, or antibody derivatives are contemplated for use with several embodiments of the 
present invention. Further, phage display or galactin-3 binding assays can be used to detect the presence and amount 
of galectin-3 in a biological sample. Such immunological assays can be done in many convenient formats and are 
known to those of skill in the art. 

25 In some embodiments, antibodies reactive to gatectin -3 are used to immunoprecipitate galectin *3 protein 

from a Over cell homogenate. In others, Western or fmmunoblots of proteins isolated from liver cells are performed 
using anti-galBctm-3 antibodies. In another preferred embodiment, antibodies will detect galectin-3 in paraffin or 
frozen sections of liver tissue, using immunocytochernical techniques. Preferred embodiments relating to methods for 
detecting the presence and amount of gatectin 3 protein also include enzyme-linked imrmmosorbant assays (EUSA), 

30 radioimmunoassays (RIA), hnmunoradiometric assays (IRMA), fluorescent activated cell sorting (FACS), and 
hnmunoenzymatic assays (1EMA), including sandwich assays using monoclonal and/or polyclonal antibodies. 
Exemplary sandwich assays are described by David et al., in U.S. Patent Nos. 4,376,1 10 and 4,486,530. 

The presence of galectin-3 in a sample may also be determined using affinity chromatography columns 
having a P-galactoside bound thereto. The proteins present in a biological sample are applied to a support having a 

35 galactoside sugar {e.g., a iactosyl-SEPHAROSE resin) under conditions which allow galectin-3 to bind to the support. 
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The presence and amount of galectin-3 can be determined directly using antibodies or phage display or the bound 
galectin-3 can be eluted by using a P-galctoside sugar (e.g., 0.1M lactose). Many more methods for accessing the 
ability galectin-3 protein to bind a p-galactoside sugar are known to the art. 

In order to provide a basis for diagnosis, normal or standard values for galectin-3 expression of a subject is 

5 preferably established. This can be accomplished in many ways and one approach is to contact body fluids, cell 
extracts, or biopsy tissue from normal subjects with antibody to galectin-3 under conditions suitable for complex 
formation. The amount of antibody bound to normal fluids, cells, and tissues (e.g., liver cells and liver tissue) is 
quantified and recorded using conventional immunological techniques. This measurement can serve as a baseline to 
which biological samples from subjects having HCC or cirrhosis of the liver or likely to develop HCC or cirrhosis of the 

10 liver can be compared. Additionally, databases having measurements of galectin-3 expression of several affCcted 
individuals (similar to that shown in Table 1) are valuable standards by which the progression of liver disease, based 
on galectin-3 expression, can be monitored. In this manner, deviation between the standard and the subject values 
establishes the presence and severity of disease state. 

In the following discussion, we provide methods and compositions which are used to detect the presence of 

15 RNA encoding galectin-3 in a biological sample so that the presence of or predisposition to HCC and cirrhosis can be 
determined. 

Detection ofgatectin-3 expression using RNA-detection diagnostics 

A wide variety of labels and conjugation techniques are known by those skilled in the art and can be used in 
various nucleic acid assays. There are several ways to produce labeled nucleic acids for hybridization or PCR 

20 (Polymerase Chain Reaction) including, but not limited to, oiigolabeiing, nick translation, end-labeling, or PCR 
amplification using a labeled nucleotide. Alternatively, a desired galectin sequence, or any portion of it may be cloned 
into a vector for the production of an mRNA probe. Such vectors are known in the art, are cornmerciaUy available, and 
may bs used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3 or SP6 
and labeled nucleotides. A number of companies such as Pharmacia Biotech (Piscataway NJ.), Promega (Madison 

25 Wis.), and U.S. Biochemical Corp (Cleveland Ohio) supply commercial kits and protocols for these procedures. Suitable 
reporter molecules or labels include those radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic 
agents as well as substrates, cof actors, inhibitors, magnetic particles and the (ike. 

For diagnostic and prognostic purposes, nucleic acid probes having a sequence complementary to a nucleic 
add encoding a galecttn may be used to detect and quantitate gene expression of the galecttn in biological samples 

30 including, but not limited to biopsied tissues or any of the biological samples discussed above. Preferably, nucleic acid 
probes which are complementary to mRNA encoding galectin-3 are used to screen for polynucleotides in liver biopsy 
tissue, Over cell extracts, or other biological samples. RNA-detection-based diagnostic assays, such as Northern 
hybridization. Northern dot blots, RNA in situ hybridization, and EUSA assays, are particularly useful to distinguish 
between the absence, presence, and excess expression of galectins (e.g., galectin-3) and to monitor regulation of 

35 galectin levels during therapeutic intervention. 
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Included in the scope of the invention are the use of oligonucleotide sequences, antisense RNA and ONA 
molecules, and PNAs which complement galectin sequences, particularly galectin-3, for the determination of galectin 
expression in liver cells and tissue by RNAbased detection techniques. These forms of polynucleotide sequences 
encoding galectin 3 may also be used for the diagnosis of conditions or diseases with which the expression of galectin- 
5 3 is associated. For example, polynucleotide sequences complementary to mRNA encoding galectin-3 may be used in 
hybridization or PCR assays of fluids or tissues from biopsies to detect galectin-3 expression and thereby identify the 
subject as having a predisposition or established case of cirrhosis of the liver and/or HCC. The form of such 
qualitative and/or quantitative methods may include northern analysis, dot blot or other membrane-based technologies; 
PCR technologies; dp stick, pin. chip and ELISA technologies. All of these techniques are well known in the art and 

1 0 are the basis of many commercially available diagnostic kits. 

In one aspect, RNA probes complementary to galectin-3 mRNA are used in assays that detect activation or 
induction associated with disease (including cirrhosis and HCC). Accordingly, the nucleotide or polypeptide sequences 
of galectin-3 (referenced in Robertson et aL, Biochemistry 29: 8093-8100 (1990) or Genbank Accession J02921, 
M57710, 4504982, or NIP §4504982, gl 79530, NMJ02306.1 and NP_002297), the disclosure of which are 

15 incorporated herein by reference in their entireties, is used to design suitable RNA probes. (See Figure 2). The RNA 
probes are labeled by methods known in the art and are added to a DNAse treated fluid or tissue sample from a 
patient under conditions suitable for the formation of hybridization complexes. Hybridization on complexes are 
isolated or the sample is treated with an agent which removes unhybridized nucleic acids. After an incubation period, 
the sample is washed with a compatible fluid which optionally contains a dye (or other label requiring a developer) if 

20 the nucleotide has been labeled with an enzyme. After the compatible fluid is rinsed off, the dye is quantitated and 
compared with a standard. If the amount of dye in the biopsied or extracted sample is significantly elevated over that 
of a comparable control sample, the nucleotide sequence has hybridized with RNA in the sample, and the presence of 
elevated levels of RNA encoding galectin-3 in the sample indicates the presence of liver disease. 

Such assays may also be used to evaluate the efficacy of a particular therapeutic treatment regime in animal 

25 studies, in clinical trials, or in monitoring the treatment of an individual patient. In order to provide a basis for the 
diagnosis of disease, a normal or standard profile for human galectin-3 expression in liver cells, extracts or tissue must 
be established. This is accomplished by combining body fluids or celt extracts taken from normal subjects with RNA 
probes encoding gatectin-3, or a portion thereof, under conditions suitable for hybridization. Standard hybridization 
may be quantified by comparing the values obtained for normal subjects with a dilution series of galectin-3 RNA run in 

30 the same experiment where a known amount of substantially purified galectin-3 RNA is used. Standard values 
obtained from normal samples are then compared with values obtained from samples from subjects thought to be 
afflicted with liver disease. Deviation between standard and subject values establishes the presence of disease. 

Additionally, PCR methods which may be used to quantitate the expression of a particular molecule include 
radiolabeling (Melby P. C. et al. J Immunol Methods 159:235-44 (1993)) or biotiny taring nucleotides (Ouplaa C. et at. 

35 Anal Biochem 212:229-236 (1993)), coamplification of a control nucleic acid, and standard curves onto which the 
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experimental results are interpolated Quantitation of multiple samples may be speeded up by running the assay in an 
EUSA format where the oligomer of interest is presented m various dilutions and a spectrophotometry or colorimetric 
response gives rapid quantitation. A definitive diagnosis of this type may allow health professionals to begin 
aggressive treatment and prevent further worsening of the condition. Similarly, further assays can be used to monitor 
the progress of a patient during treatment* 

Once disease is established, a therapeutic agent is administered and a treatment profile is generated. Such 
assays may be repeated on a regular basis to evaluate whether the values in the profile progress toward or return to 
the normal or standard pattern. Successive treatment profiles may be used to show the efficacy of treatment over a 
period of several days or several months. 

As mentioned above, PCR technology may be used to identify and quamitate expression of galectins, 
particularly galecth>3. For amplification of mRNAs, it is within the scope of the present invention to reverse 
transcribe mRNA into cONA followed by PCR (RT-PCR); or, to use a single enzyme for both steps as described in U.S. 
Patent No. 5,322,770, the disclosure of which is incorporated herein by reference in its entirety, or, to use Reverse 
Transcriptase Asymmetric Gap Ligase Chain Reaction (RT-AGICR), as described by Marshall R.L et al. \PCR Methods 
and Applications 4:80-84, 1 984), the disclosure of which is incorporated herein by reference in its entirety. 

A variety of PCR techniques are famffiar to those skilled in the art For a review of PCR technology, see 
Molecular Cloning to Genetic Engineering White, BA Ed. in Methods in Molecular Biology 67: Humana Press, Totowa 
(1997), the disclosure of which is incorporated herein by reference in its entirety and the publication entitled "PCR 
Methods and Appncattons" (1991, Cold Spring Harbor Laboratory Press), the disclosure of which is incorporated 
herein by reference in its entirety. In each of these PCR procedures, PCR primers on either side of the galectn>3 
sequence to be amplified are added to a suitably prepared nucleic add sample along with dNTPs and a thermostable 
polymerase such as Taq polymerase, Pfu polymerase, or Vent polymerase. The nucleic acid in the sample is denatured and 
the PCR primers are specifically hybridized to complementary nucleic add sequences in the sample. The hybridized primers 
are extended. Thereafter, another cyde of denaturation, hybridization, and extension is initiated. The cydes are repeated 
multiple times to produce an amplified fragment containing the nucleic acid sequence between the primer sites. PCR has 
further been described in several patents induding US Patents 4,683,195, 4,683,202 and 4,965,188, the disclosure 
of which is incorporated herein by reference in its entirety. 

The primers are selected to be substantially complementary to a portion of the sequence of galecting-3 
mRNA and a portion of the sequence complementary to the sequence of galectin-3 mRNA, thereby allowing the 
sequences between the primers to be amplified. The length of the primers for use with aspects of the present 
invention can range from 8 to 100 nucleotides, preferably from 8 to 50, 8 to 30 or more preferably 8 to 25 
nucleotides. Shorter primers tend to lack specificity for a target nucleic acid sequence and generally require cooler 
temperatures to form sufficiently stable hybrid complexes with the template. Longer primers are expensive to produce 
and can sometimes self-hybridize to form hairpin structures. The formation of stable hybrids depends on the melting 
temperature (Tm) of the DNA. The Tm depends on the length of the primer, the ionic strength of the solution and the 
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6+C content. The higher the G+C content of the primer, the higher is the melting temperature because 6:C pain are 
held by three H bonds whereas A:T pairs have only two. The G*C content of the amplification primers of the present 
invention preferably ranges between 10 and 75 %, more preferably between 35 and 60 %, and most preferably 
between 40 and 55 %. The appropriate length for primers under a particular set of assay conditions may be 

5 empirically determined by one of skill in the art 

The spacing of the primers determines the length of the segment to be amplified. In the conteit of the 
present invention amplified segments carrying nucleic acid sequence encoding a galectin can range in size from at least 
about 25 bp to 35 kbp. Amplification fragments from 25-3000 bp are typical fragments from 50-1000 bp are 
preferred and fragments from 100-600 bp are highly preferred. It will be appreciated that amplification primers for a 

10 specific galectin may be any sequence which allows the specific amplification of any DNA fragment carrying nucleic 
acid sequence unique to the particular galectin. Amplification primers may be labeled or immobilized on a solid support 
as described above. 

In thB following discussion, we provide methods and compositions which are used to detect patterns of 
galectin expression in a biological sample so that the predisposition to disease can be determined. 

1 5 Detection of patterns of galectin expression to identify predisposition to disease 

Another aspect of the present invention is a method of correlating the ratio between the expression levels of 
a plurality of galectins with a disease state comprising obtaining biological samples from individuals suffering from the 
disease and normal individuals, determining the expression levels of two or more galectins in the samples, and 
determining whether there is a statistically significant association between the ratio of galectin expression and the 

20 disease state. Statistically significant associations can be determined using statistical methods familiar to those 
skilled in the art, including chi-squared analyses. In some embodiments, the disease state is cancer. Galectin 
expression may be measured using any of the methods described above for detecting galecting-3 protein or mRNA. 
Preferably, the galectins whose expression levels are measured to determine the ratios are selected from the group 
consisting of galectins MO. Alternatively, the galectins may be galectins which are currently unknown but which are 

25 identified based on their possession of one or more of the homology regions presented in Table III. Table 111 shows 
examples of conserved galectin residues from members of different animal species; the conserved residues are boxed. 
Simflar patterns of amino acid conservation in context of each other can be used to differentiate galectin homologs in 
one animal species. Preferably, galectin expression is measured using the protein-based or nucleic acid based 
detection methods described above for gaJectin-3. 

30 The overexpression, underexpression, or. sometimes, any expression of members of the galectin family is 

associated with several disease states and most acutely cancer. Galectin- 1 and galectin-3 expression, for example, 
have been associated with tumor metastasis. (Raz et al. Cancer Res. 46:3667(1986) and Raz et al., Int. J. Cancer 
46:871 (19901. 6alectin 5 is thought to mediate cell adhesion and galectin-5H is believed to be involved with the 
onset of inflammatory disease and several forms of neoplastic disease. (Hillman et aU U.S. Pat. No. 5,837,493). The 

35 expression of gaJectin-7 has been observed to be downregulated in a mafignant keratinocyte cell line and is thought to 
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be required for the maintenance of normal keratinocytes. IHiltman et aL, U.S. Pat. No. 5,837,493). The appearance 
of galectin 8 and 9 have also been associated with the onset of cancer. In the table below, several tumors that have 
been associated with the expression of various galectins are listed. 



Association of Galectins with Neoplasms 



Table II 

Tumor Type 
Bladder carcinoma 

Anaplastic large cell 
lymphoma 

Colorectal carrinama 
Colorectal carcinoma 

Ovarian carcinoma 

Breast carcinoma 

Thyroid carcinoma 
Colorectal carcinoma 
Various 

Prostate Carcinoma 
Hodgkin's lymphoma 

Because the family of galectins share several conserved domains, the elucidation of many more galectin 
family members will be forthcoming. (See Table 3). Many more tumor markers and/or markers for other forms of 
10 disease will be found. 



Galectin 


Normal Tissue Expression 


Properties 


1 


Expressed 


Upregulated in high-grade 






tumors 


3 


Absent in T lymphocytes 


Tumor marker 


3 


Expressed 


Oownregulated in 






adenocarcinoma 


3 


Nuclear and cytoplasmic 


Conversion from nuclear 






to cytoplasmic with tumor 






progression 


3 


Expressed 


Oownregulated in 






carcinoma 


3 


Expressed 


Oownregulated in 






carcinoma 


3 


Expressed 


Tumor marker 


4 


Expressed 


Oownregulated 


7 


Expressed in stratified epithelia 


Downreguiation antigen 




Metapiasiogenic marker 


8 


Absent 


Tumor marker 


9 


Absent 


Tumor marker 



TABLE III 

cegal -~ 

eelgal - 

(Seq. ID No. 1) Chgal3 MQAMKARCWQ PHWMLPLLPL SSPLHPQLSD ALPAHNPGAP PPQGWNRPPG 

20 ragaH * *-* ~— *"* -« — * * ' 

mugaH -r — — " 

hugall — mmmm * 

hugal* - — 

25 51 

cegal — — — — ■ — .-^ - 

eelgal -*>• " "* 
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(Seq. 10 No. 2) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 



(Seq. 
(Seq. 
(Seq, 
(Seq. 
(Seq. 
(Seq. 
(Seq. 



(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq, 



(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 
(Seq. 



10 Ho. 
ID No. 
ID No. 
ID No. 
ID No. 
ID No. 
ID No. 



ID No. 
ID No. 
ID No. 
ID No. 
ID No. 
ID No. 
ID No. 



ID No. 
ID No. 
ID No. 
ID No. 
ID No. 
ID No. 
ID No. 



ID No. 
ID No. 
ID No. 
ID No. 
ID No. 
ID No. 
ID No. 



chga!3 
ragal4 
raugal4 
hugall 
huga!4 



3) cegal 

4) eelgal 

5) chga!3 

6) ragal4 

7) mugal4 

8) hugall 

9) huga!4 



(Seq. 
(Seq. 



ID No. 
ID No. 



PGAFPAYPGY PGAYPGAPGP YPGAPGPHHG PPGPYPGGPP GPYPGGPPGP 



101 150 

—-MSAEEP KSYP..VPYR SVLQEKF EPGQTL 

SGGLQVK NFDFTVGKFL 

YPGGPPGPYP GGPTAPYSEA PAAPLKVPYD LPLPAGLMPR LL I 

M AYVPAPGYQP TYNP.TLPYK RPIPGGL SVGMSI 

^ -PIPGGL SVGMSV 

MACGLVAS NLNLKPGECL 

M AYVPAPGYQP TYNP.TLPYY QPIPGGL NVGMSV 



10) 
11) 



cegal 
eelgal 

12) chgal3 

13) ragal4 

14) mugal4 

15) hugall 

16) huga!4 



17) cegal 

18) eelgal 

19) chga!3 

20) ragal4 

21) mugal4 

22) hugall 

23) huga!4 



24) 
25) 
26) 
27) 
28) 
29) 
30) 



(Seq. ID No. 31) 



32) 
33) 



(Seq. ID No. 34) 



cegal 
eelgal 
chgal3 
ragal4 
raugal 4 
hugall 
huga!4 



cegal 
eelgal 
Chgal3 
ragal4 
mugal4 
hugall 
huga!4 



151 

IVKGSTID. . 
TVGGFINNSP 
T1TGTVNSNP 
YIQGIAKD. . 
YIQGMAKE. . 
RVRGEV. .AP 
YIQGVASE. . 

201 

VLNSF. . .SN 
VMNSTLKGDN 
VCNSMF . . QN 
VFNTM. . .QS 
VFKTM. . .QS 
VCNS. .K.DG 
VFNTL. . .QG 

251 

EFKDYEHR.L 
NL.EFPNR.Y 
HLLQFNFREK 
PFYEYGHR. L 
SFYEYGHR.L 
EF.KFPNR.L 
PFYEYGHR.L 

301 

.YP 



(Seq. ID No. 35) cegal 
eelgal 
chgal3 

(Seq. ID No. 36) cagal4 
(Seq. ID No. 37) rougal4 
hugall 

(Seq. ID No. 38) huga!4 



(Seq. ID No. 39) cegal 
eelgal 
chgal3 

(Seq. ID No. 40) ragal4 
(Seq. ID No. 41) mugal4 
hugall 

(Seq. ID No. 42) huga!4 



ESQRFTINLH 
. .QRFSVN. . 
N . . RFSLDFK 
NMRRFHVNF. 
NMRRFHVNF. 
DAKSFVLN. . 
HMKRFFVNF. 



GEWGKEERK . 
G . WETEQRST 
N . WGKEERTA 
GQWGKEEKKK 
GQWGKEEKKK 
GAWGTEQREA 
GKWGSEERKR 



PLSSISHLSI 
SKEFLPFLSL 
KLNGITKLCI 
PLQMVTHLQV 
PVQMVTHLOV 
NLEAINYMAA 
PLQMVTHLQV 



200 

SKTADFSGND VPLHVSVRFD E.G. . .K. .1 
.VGESM..NS LSLHLDHRFN .YGAD.QNTI 

R.GQD IAFHFNPRFK E . . . DHKRVI 

AVGQD.EGAD IAFHFNPRFD . .GWD.K. .V 
AVGQD.DGAD VAFHFNPRFD . .GWD.K. .V 
. LGKDS . . NN LCLHFNPRFN AHG.D.ANTI 
WGQD.PGSD VAFHFNPRFD . .GWD.K. .V 



. SNPIKKGDS 
. N FTLSAGQY 
PRFPFEPGTP 
. SMPFQKGHH 
. SMPFQKGKH 
.VFPFQPCSV 
.SMPFKKGAA 



FDIRIRAHDD 
FEITLSYDIN 
FKLQVLCEGD 
FELVFMVMSE 
FELVFMVMPE 
AEVCITFDQA 
FELVFIVMAE 



250 

RFQI . IVDHK 
KFYIDILDGP 
HFKVAV.NDA 
HYKV.WNGT 
HYKV.WNGN 
NLTVKLPDGY 
HYKV.WNGN 

300 



DGDLYLNHV. .HW.GGK Y 

AGDARLTLV . . K.E 

AGDITLTSVL TSMI 

DGDLELQSI. .NFLGGQPAA SQYPGTMTIP 

DGDLELQSI. .NFLGGQPAA APYAGAMTIP 

DGDFKIKCV. . AFD 

OGDLQLQSI. . NFIGGQPLR PQ. .GPPMMP 

350 

V PYESGLANGL PVGKSLLVFG 



AYP..SAGYN PPQMNSLPVM AGPPIFNPPV PYVGTLQGGL TARRTIIIKG 
AYPAGSPGYN PPQMNTLPVM TGPPVFNPRV PYVGALQGGL TLPRTIUKG 

PYP..GPGHC HQQLNSLPTM EGPPTFNP.V PYFGRLQGGL TARRTIIIKG 

351 

TVEKKAKRFH VNL.LRKNGD ISFHFNPRFD EKHVIRNSLA ANEWGNEERE 



YVLPTAKNLI INFKVGSTGD IAFHMNPRIG D . CWRNSYM NGSWGSEERK 
YVLPTARNFV INFKVGSSGD IALKLNPRIG D.SWRNSFM NGSWGAEERK 

YVPPTCKSFA INFKVGSSGD IALHINPH.G NGTWRNSLL NGSWGSEEKK 

401 «0 
. GKNPFEKGV G . . FDLVIQN EEYAFQVFVN GERYISFAHR ADPHD.IAGL 



IPYNPF. .GA GQFFDLSIRC GTDRFKVFAN GQHLFOFSHR FQAFQRVDML 
VAYNPF..GP GQFFDLSIRC GMDRFKVFAN GQHLFOFSHR FQAFQMVDTL 

TTHNPF. .GP GQFFDLSIRC GLDRFKVYAN GQKLFOFAHP SRAFQRVDTL 
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451 465 

(Seq. ID No. 4 3) cegal QISGDIELSG IQIQ 

eelgal — ~— ~ — 

chgal3 

5 (Seq. 10 No. 44) ragal4 EIKGDITLSY VQI- 

(Seq. ID No. 45) mugal4 EINGDITL — 

hugall *: 

<Seq. ID No. 46) huga!4 EIQGDVTLSY VQI~ 

10 (Seq. ID No. 47) DNA Human galectin-3 

CCAGCCAACGAGCGGAAAATGGCAGACAATTTTTCGCTCCATGATGCGTTATCTGGGTCT 
GGAAACCCAAACCCTCAAGGAT GGCCTGGCGCATGGGGG AACCAG CCTGCTGGGGCAGGG 
GGCTACCOIGGGGCTTCCTATCCTGGGGCCTACCCCGGGCAGGCACCCCCAGGGGCTTAT 
CCTGGACAGX^ACCTCCAGGCGCCTACCATGGAGCACCTGGAGCTTATCCCGGAGCACCT 

15 GCACCrGGAGTCTACCCAGGGCCACCCAGCGGCCCTGGGGCCTACCCATCTTCTGGACAG 
CCAAGTGCCCCCGGAGCCTACCCTGCCACTGGCCCCTATGGCGCCCCTGCTGGGCCACTG 
ATTGTGCCTTATAACCTGCCTTTGCCTGGGGGACTGGTGCCTCGCATGCTGATAACAATT 
CTGGGCACGGTGAAGCCCAATGCAAACAGAATTGCTTTAGATTTCCAAAGAGGGAATGAT 
GTTGCCTTCCACTTTAACCCACGCTTCAATGAGAACAACAGGAGAGTCATTGTTTGCAAT 

20 ACAAAGCTGGATAATAACTGGGGAAGGGAAGAAAGAC^GTCGGTTTTCCCATTTGAAAGT 
GGGAAACCATTCAAAATACAAGTACTGGTTGAACCTGACCACTTCAAGGTTGCAGTGAAT 
GATGCTCACTTGTTGCAGTACAATCATCGGGTTAAAAAACTCAATGAAATCAGCAAACTG 
GGAATTTCTGGTG ACAT AGACCTC ACCAGT GCTT CATAT ACCAT GAT AT AATCTG AAAGG 
GGCAGArrAAAAAAAAAAAAAAAGAATCTAAACCTTACATCT^ 

25 CTGTGAGTGAAAATTTTTACATTCATCAATATCCCTCTTGTAAGTCATCTACTTAATAAA 
TATTACAGTGAAAG © 



35 



cegal 


- nematode 


eelgal 


- eel 


chgal3 


- chicken 


ragal4 


- rat 


mugaX4 


- mouse 


hugall 


- human 


huga!4 


- human 



In another aspect of the present invention, we contemplate the identification of patterns of galectin 
40 expression so as to identify a disease state. When the expression of all members of the galectin family are analyzed, a 
predisposition for various diseases can be ascertained. In particular, the complete expression pattern of the galectin 
family in all normal and tumor tissues will indicate both preneoplastic and post-neoplastic events. 

By one approach, diagnostic tools which simultaneously screen for all galectin family members are 
developed. Current technology, such as gene array scanning systems permits the execution of our strategy due to its 
45 inherent large-scale screening abilities, accuracy, generation of quantitative information and efficiency in both 
preparation of diagnostic arrays and data scanning and accumulation. Accordingly, the approaches to monitoring the 
expression of galectin-3, detailed above, (e.g., the protein and RNA detection methods) can be modified so that the 
expression of several galectins from numerous tissues or biological sample sources can be simultaneously analyzed. 
The use of traditional techniques, such as antibody-mediated serum assays, immunohistochemistry, ELISA, genechip 
50 northerns, and PCR will permit us to screen for multiple galectins rapidly. Additionally, the quantity of each galectin is 
preferably determined. The amount of RNA or protein corresponding to the various galectins can be quantified by the 
approaches detailed above or by many other methods known to those of skill in the art. For example, the total protein 
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or RNA in a sample is first determined by conventional techniques (e.g., by spectroscopy), and then by using a probe 
having a label with a known specific activity, the amount of galectin expression can be accurately determined and 
normalized to a "house keeping marker." 

Once the levels of the various galectins are determined, the information is recorded onto a computer readable 
5 media, such as a hard drive, floppy disk, DVD drive, zip drive, etc.. After recording and the generation of a database 
comprising the levels of expression of the various galectins studied, a comparing program is used which compares the 
levels of expression of the various galectins so as to create a ratio of expression, in a first comparison, a galectin to 
galectin ratio is generate! For example, desirable galectin to galectin ratios include, but are not limited to: galectin* 
1 :galectin-3. galectin- 1 :galectin-5, galectin- 1 :galec tin- 7. galectin- 1 :galectin-8, galectin- 1 :ga!ectin-9, galectin- 

10 3:galectin-5, galectin-3:galectin-7, galecth>3:galectirv8, galectin-3:galectin-9. gaIectin5:gaIectin-7. galectin- 
5:galectin-8, galectin-5:galectin-9, galectin-7:galectin-8. galectin-7:galectm-9. and galectin-8:galectin-9. As other 
newly found galectins are identified, we contemplate screening for their expression patterns and incorporating the 
levels of expression into our ratio analysis approach to disease prognosis. In a second comparison, the galectin to 
galectin ratios from normal subjects are compared to galectin to galectin ratios of subjects having various diseases. 

15 Preferably, the diseased subjects studied initially have been identified as having a form of cancer. Desirably, several 
databases ere generated comprising the galectin to galectin ratios from normal individuals and the galectin to galectin 
ratios from diseased subjects so that a statistical analysis can be accurately performed In this manner patterns of 
galectin expression are analyzed and the predisposition to galectin related disease is determined. 

Individuals who wish to be screened for a predisposition for a galectuwelated disease can have a biological 

20 sample taken, and the RNA or protein present in the sample anlayzed for the galectin expression pattern, and the 
galectin to galectin ratios are compared to galectin to galectin ratios exhibited by both normal and diseased subjects. 
Depending on the comparison of the normal and diseased ratios, a helath care practitioner can choose an appropriate 
course of treatment and/or prevention. 

In the following discussion, we provide methods and compositions which are used to treat and/or prevent 

25 diseases associated with increased levels of galectin expression. 
Therapeutic and prophylactic agents 

In addition to the diagnosis of liver disease and galectin-3 related diseases, galectin-3 -specific antibodies are 
useful for the treatment of contfitions and diseases associated with expression of galectin-3. Such antibodies include, 
but are not limited to, polyclonal, monoclonal, chimeric, single chain, Fab fragments and fragments produced by a Fab 

30 expression library. Neutralizing antibodies, i.e., those which inhibit dimer formation, are especially preferred for 
diagnostics and therapeutics. 

Galectin-3 protein to be used for antibody production need not retain biological activity; however, the 
protein fragment, or oligopeptide is desirably antigenic. Peptides used to induce specific antibodies may have an 
amino acid sequence consisting of at least five amino acids, preferably at least 10 amino adds. Preferably, they 

35 should mimic a portion of the amino acid sequence of the natural protein and may contain the entire amino acid 
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sequence of a small naturally occurring molecule. Short stretches of gafecttn-3 amino acids may be fused with those 
of another protein such as keyhole limpet hemocyanin and antibody produced against the chimeric molecule. 

For the production of antibodies, various hosts including goats, rabbits, rats. mice, etc may be immunized by 
injection with galectin-3 or any portion, fragment or oligopeptide which retains immunogenic properties. Depending on 
5 the host species, various adjuvants may be used to increase immunological response. Such adjuvants include but are 
not limited to Freund's, mineral gels such as aluminum hydroxide, and surface active substances such as rysotecithin, 
pturonic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin. and dinrtrophenoL BCG (Bacillus 
Calmette-Guerin) and Corynebacterium parvum are potentially useful adjuvants. 

Monoclonal antibodies to galectin-3 may be prepared using any technique which provides for the production 
10 of antibody molecules by continuous celJ lines in culture. These include but are not limited to the hybridoma technique 
originally described by Koehler and Mtlstein (Nature 256:495497 (1975), the human B-cell hybridoma technique 
(Kosbor et aL Immunol Today 4:72 (1983); Cote et al Proc Natl Acad Sci 80:2026-2030 (1983). and the EBV- 
hybridoma technique Cole et at. Monoclonal Antibodies and Cancer Therapy, Alan R. Liss Inc, New York N.Y.. pp 77-96 
(1985). 

15 In addition, techniques developed for the production of "chimeric antibodies*, the splicing of mouse antibody 

genes to human antibody genes to obtain a molecule with appropriate antigen specificity and biological activity can be 
used. (Morrison et aJ. Proc Natl Acad Sci 81:6851-6855 (1984); Neuberger et aJ. Nature 312:604-608(1984); 
Takeda et al. Nature 314:452454(1985). Alternatively, techniques described for the production of single chain 
antibodies (U.S. Pat. No. 4,946.778) can be adapted to produce galectin-3 -specific single chain antibodies. 

20 Antibodies may also be produced by inducing in vivo production in the lymphocyte population or by screening 

recombinant immunoglobulin libraries or panels of highly specific binding reagents as disclosed in Orlandi et aL, Proc 
Natl Acad Sci 86: 3833-3837 (1989), and Winter 6. and Milstein C; Nature 349:293-299 11991). 

Antibody fragments which contain specific binding sites for galectin-3 may also be generated. For example, 
such fragments include, but are not limited to, the Flab*), fragments which can be produced by pepsin digestion of the 

25 antibody molecule and the Fab fragments which can be generated by reducing the disulfide bridges of the F(ab'), 
fragments. Alternatively. Fab expression libraries may be constructed to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity Huse W. 0. et al. Science 256:1275-1281 (1989). 

By "antibody therapy" is meant the administration of an antibody, an antibody conjugate or an antibody 
heteroconjugate to a subject in need for the purpose of treating or preventing a disease. Many forms of antibody 

30 therapy are known in the art including, but not limited to, the administration of monoclonal or humanized monoclonal 
antibodies, the administration of toxin or radionuclide conjugated antibodies, and the administration of monoclonal 
antibody heteroconjugates having one domain that binds to a disease-related antigen (e.g., a gaiectin such as galectin- 
3) and another domain that binds to the Fc region of IgG. Further, the term antibody therapy is meant to include the 
administration of (Fabl, or Fab fragments with or without conjugated toxins or radionuclides. Several examples of 
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antibody therapy are found in Recent Results in Cancer Research vol 14? .-Systemic Radiotherapy with Monoclonal 
Antibodies* edited by Ml. Sautter Bihl and M. Wannenmacher, Sprmger-Veriag pubBshers, 1996. 

Radiolabeled mAbs specific for a tumor antigen (e.g., a galectin such as galectin-3) are prepared by labeling 
the antibody with an isotope or combinations of isotopes, such as ,31 l. "Y. "Cu, ,M Re, ,M Re, 2,2 Bi or m At. Preferable 
5 radiolabeled mAbs are able to deliver more than 6000 rads to the tumor and have sufficient affinity so that the 
patient's bone marrow is not exposed to more than 300 rads. We contemplate the use of m l labeled anti-galectin-3, 
as well as other radiolabeled mAbs for treatment of HCC and cirrhosis of the liver. In this aspect and in several other 
embodiments using radionuclide or toxin conjugated antibodies, a delivery system which preferentially allows delivery 
of the antibody therapy agent to diseased ceJJs is desired. By one approach, a liposome having the antibody therapy 

10 agent and a membrane bound protein which interacts specifically with liver ceils is used. Additionally, the located 
delivery of the antibody therapy agent (e.g., injecting the agent directly into Hver tissue) is contemplated. 

Further, the development of peptidomimetics and other compounds which interfere with galectin-3 function 
so as to generate pharmaceuticals which can be used to treat galectin-3 related liver diseases are embodiments of the 
invention. Galectin-3, its catalytic or immunogenic fragments or oligopeptides thereof, can be used for screening 

15 therapeutic compounds in any of a variety of drug screening techniques. The fragment employed in such a test may 
be free in solution, affixed to a solid support, borne on a ceil surface, or located intraceltularly. The formation of 
binding complexes, between human galectin-3 and the agent being tested, may be measured. Another technique for 
drug screening which may be used for high throughput screening of compounds having suitable binding affinity to 
galectin-3 is described in detaO in "Determination of Amino Acid Sequence Antigenicity- by Geysen H. N., WO 

20 Application 84/03564, published on Sep. 13, 1984, and incorporated herein by reference. In summary, large numbers 
of different small peptide test compounds are synthesized on a solid substrate, such as plastic pins or some other 
surface. The peptide test compounds are reacted with fragments galectin-3 and washed. Bound galectin-3 is then 
detected by methods well known in the art. Substantially purified galectin-3 can also be coated directly onto plates 
for use in the aforementioned drug screening techniques. Alternatively, non-neutralizing antibodies can be used to 

25 capture the peptide and immobilize it on a solid support or many of the approaches for attaching a biomoiecule to a 
support, described above, can be used. Once test compounds are found, they ere preferably tested in a test subject 
having liver disease such as HCC and cirrhosis of the liver and the effect on galectin-3 activity is analyzed by one or 
more of the approaches detailed above. Successful agents are then incorporated into pharmaceuticals and are used to 
treat and/or prevent liver diseases such as HCC and cirrhosis of the liver. 

30 Additionally, we contemplate the use of antisense and ribozyme technology to inhibit the expression of galectin-3 

and/or other galectins. The antisense nucleic acid molecules to be used in gene therapy may be either DNA or RNA 
sequences. They may comprise a sequence complementary to the sequence of mRNA encoding galectin-3. The antisense 
nucleic acids should have a length and malting temperature sufficient to permit formation of an intracellular duplex having 
sufficient stabifity to inhibit the expression of the mRNA in the duplex. Strategies for designing antisense nucleic acids 
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suitable for use in gene therapy are disclosed in Green et aL. Ann. /lev. Biochem. 55:569-597 (1986) and tzant and 
Weintraub, Ceff, 36:1007-1015 (1984). 

In some strategies, antisense molecules are obtained from a nucleotide sequence encoding galectin-3 by reversing 
the orientation of the coding region with respect to a promoter so as to transcribe the opposite strand from that which is 

5 normally transcribed in the cell Antisense molecules and ribozymes may be prepared by any method known in the art 
for the synthesis of RNA molecules. These include techniques for chemically synthesizing oligonucleotides such as 
solid phase phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by in vitro and in 
vivo transcription of ONA sequences encoding galectin-3 or another galectin. Such DNA sequences may be 
incorporated into a wide variety of vectors with suitable RNA polymerase promoters such as T7 or SP6. 

10 Alternatively, antisense cONA constructs that synthesize antisense RNA consthutively or inducibly can be introduced 
into cell lines, cells or tissues. 

Alternatively, ofigonudeotides which are complementary to the mRNA encoding galectin-3 may be synthesized in 
v'rtro. Thus, antisense nucleic acids are capable of hybridizing to the galectin-3 mRNA to create a duplex. In some 
embodiments, the antisense sequences may contain modified sugar phosphate backbones to increase stabiity and make 

15 them less sensitive to RNase activity. Possible modifications include, but are not limited to, the addition of flanking 
sequences at the 5' and/or 3* ends of the molecule or the use of phosphorothioate or T 0-methyl rather than 
phosphodiesterase linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nonttaditional bases such as inosine, queosine and 
wybutosine as well as acetyl-, methyl-, thio- and similarly modified forms of adenine, cytidine, guanine, thymine, and 

20 uridine which are not as easily recognized by endogenous endonudeases. Further examples ere described by Rossi et 
aL, Pharmacol Tto., 50121:245-254. (1991). 

Various types of antisense oligonucleotides complementary to the galectin-3 mRNA may be used. In one 
preferred embodiment, stable and semi-stable antisense oligonucleotides described in International Application No. PCT 
WO94/23026. hereby incorporated by reference, are used. In these moleucles, the 3 end or both the 3 and 5 ends are 

25 engaged in intramolecular hydrogen bonding between complementary base pairs. These molecules are better able to 
withstand exonudease attacks and exhibit increased stability compared to conventional antisense oligonucleotides. In 
another preferred embodiment, the antisense ofigodeoxy nucleotides described in International Application No. WO 
95/04141 aroused. 

In yet another preferred embodiment, the covalentry cross-linked antisense oligonucleotides described in 
30 International Application No. WO 96/31523 are used. These double- or single-stranded oligonucleotides comprise one or 
more, respectively, inter- or intra-oligonucieotide covalent cross-linkages, wherein the linkage consists of an amide bond 
between a primary amine group of one strand and a carboxyl group of the other strand or of the same strand, respectively, 
the primary amine group being directly substituted in the 2' position of the strand nucleotide monosaccharide ring, and the 
carboxyl group being carried by an afiphatic spacer group substituted on a nucleotide or nucleotide analog of the other 
35 strand or the same strand, respectively. 
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The antisense oltgodeoxynucleotides and oligonucleotides cfisclosed in International Appfication No. WO 
92/18522 may also be used. These molecules are stable to degradation and contain at least one transcription control 
recognition sequence which binds to control proteins and are effective as decoys therefor. These molecules may contain 
"hairpin" structures, "dumbbell" structures, "modified dumbbell" structures, "cross-linked" decoy structures and "loop" 
5 structures. In another preferred embodiment, the cyclic double-stranded oligonucleotides described in European Patent 
Application No. 0 572 287 A2 are used. These figated oligonucleotide "dumbbells" contain the binding site for a 
transcription factor and inhibit expression of the gene under control of the transcription factor by sequestering the factor. 
Use of the dosed antisense oligonucleotides dsdosed in International Application No. WO 92/19732, hereby incorporated 
by reference, is also contemplated. Because these molecules have no free ends, they are more resistant to degradation by 

10 exonucleases than are conventional oligonucleotides. These oligonucleotides may be multifunctional interacting with 
several regions which are not adjacent to the target mRNA. 

The appropriate level of antisense nucleic acids required to inhibit gene expression may be determined using in 
vitro expression analysis. The antisense molecule may be introduced into the ceOs expressing galectin-3 by diffusion, 
injection, infection or transection using procedures known in the art For example, the antisense nucleic acids can be 

1 5 introduced into the body as a bare or naked oBgonucleotide, oligonucleotide encapsulated in lipid, oligonucleotide sequence 
encapstdated by viral protein, or as an oligonucleotide operabfy linked to a promoter contained in an expression vector. The 
expression vector may be any of a variety of expression vectors known in the art including retroviral or viral vectors, 
vectors capable of exttachromosomal replication, or Integrating vectors. The vectors may be DNA or RNA. 

The antisense molecules are introduced onto ceil samples at a number of different concentrations preferably 

20 between 1x1tt ,0 Mto IxWM. Once the iwiwnum concentration that can alequatelycorrtrolger® expression is idenn^ 
the optimized dose is translated into a dosage suitable for use in vivo. For example, an Inhibiting concentration in culture of 
IxlO 7 translates into a dose of approximately 0.6 mg/kg bodywetght Levels of oligonucleotide approaching 100 mg/kg 
bodywehjht or higher may be possible after testing the toxicity of the oligonucleotide in laboratory animals. It is 
additionally contemplated that cells from a vertebrate, such as a mammal or human, are removed, treated with the 

25 antisense ofigonucfeotide, and reintroduced into the vertebrate. 

Ribozymes may also be used to reduce or eliminate galectin-3 expression. Ribozymes are enzymatic RNA 
molecules capable of catalyzing the specific cleavage of RNA. The mechanism of ribozyme action involves sequence- 
specific hybridization of the ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. 
Within the scope of aspects of the invention, are engineered hammerhead motif ribozyme molecules that can 

30 specifically and efficiently catalyze endonucleolytic cleavage of a sequence encoding human galectin-3, for example. 
Specific ribozyme cleavage sites within any potential RNA target are initially identified by scanning the target 
molecule for ribozyme cleavage sites which include the following sequences, GUA, GUU and GUC. Once identified, 
short RNA sequences of between 15 and 20 ribonucleotides corresponding to the region of the target gene containing 
the cleavage site may be evaluated for secondary structural features which may render the oligonucleotide inoperable. 

35 The suitability of candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
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oligonucleotides using ribonuclease protection assays. Delivery of antisense and ribozyme agents by transfoction and 
by liposome are quite well known in the art. 

The pharmacologically active compounds of this invention can be processed in accordance with conventional 
methods of galenic pharmacy to produce medicinal agents for administration to patients, eg- mammals inchjding 
humans. The antibodies, fragments thereof, peptidomimetics, antisense, and ribozymes may be incorporated into a 
pharmaceutical product with andlor without modification. Further, we envision the manufacture of pharmaceuticals 
or therapeutic agents which deliver ami-galectin-3 agents or a nucleic acid sequence encoding the anti-galectin-3 
agent by several routes. For example, and not by way of fimitation, the use of DNA, RNA, and viral vectors having 
sequence encoding an antibody to gaJectuv3 is contemplated. Nucleic acids encoding an antibody to galectin-3 can be 
administered alone or in combination with peptides or an antibody to galectin-3. 

The compounds of this invention can be employed in admixture with conventional excipients, le., 
pharmaceutical^ acceptable organic or inorganic carrier substances suitable for parenteral, enteral (e.g., oral) oj 
topical application which do not deleteriously react with the active compounds. Suitable pharmaceutical* acceptable 
carriers include but are not limited to water, salt solutions, alcohols, gum arabic, vegetable oils, benzyl alcohols, 
polyetylene glycols, gelatine, carbohydrates such as lactose, amylose or starch, magnesium stearate, talc, silicic acid, 
viscous paraffin, perfume oil. fatty acid monoglycerides and digh/cerides, pentaerythritol fatty acid esters, hydroxy 
methylcellulose, polyvinyl pyrrolidone. etc The pharmaceutical preoptions can be sterilized and if desired mixed 
with auxiliary agents. e.g., lubricants, preservatives, stabilizers, wetting agents, emulsifiers. salts for influencing 
osmotic pressure, buffers, coloring, flavoring and/or aromatic substances and the fake which do not deleteriously react 
with the active compounds. They can also be combined where desired with other active agents, e.g., vitamins. 

For parenteral application, particularly suitable are injectable, sterile solutions, preferably oily or aqueous 
solutions, as well as suspensions, emulsions, or implants, including suppositories. Ampoules are convenient unit 
dosages. 

For enteral application, particularly suitable are tablets, dragees, liquids, drops, suppositories, or capsules. A 
syrup, elixir, ox the like can be used wherein a sweetened vehicle is employed. 

Sustained or directed release compositions can be formulated, e.g- liposomes or those wherein the active 
compound is protected with differentially degradable coatings, e.g.. by microencapsulation, multiple coatings, etc It 
is also possible to freeze-dry the new compounds and use the lyophilizates obtained, for example, for the preparation 
of products for injection. 

For topical application, there are employed as non-sprayable forms, viscous to semi-soDd or solid forms 
comprising a carrier compatible with topical application and having a dynamic viscosity preferably greater than water. 
Suitable formulations include but are not limited to solutions, suspensions, emulsions, creams, ointments, powders, 
liniments, salves, aerosols, etc., which are, if desired, sterilized or mixed with auxiUary agents, e.g.. preservatives, 
stabilizers, wetting agents, buffers or salts for influencing osmotic pressure, etc. For topical application, also suitable 
are sprayable aerosol preparations wherein the active ingrecrent preferably in combination with a solid or liquid inert 
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carrier material is packaged in a squeeze bottle or in admixture with a pressurized volatile, normally gaseous 
propellant. e.g., a freon. 

It wfll be appreciated that the actual preferred amounts of active compound in a specific case will vary 
according to the specific compound being utilized, the particular compositions formulated, the mode of application, and 

5 the particular situs and organism being treated. Pharmaceutical compositions suitable for use in the present invention 
include compositions wherein the active ingredients are contained m an effective amount to achieve the intended 
purpose. Dosages for a given host can be determined using conventional considerations, e.g., by customary 
comparison of the differential activities of the subject compounds and of a known agent. e.g.. by means of an 
appropriate, conventional pharmacological protocol. The determination of an effective dose is well within the 

10 capability of those skilled in the art. 

For any compound, the therapeutically effective dose C8n be estimated initially either in cell culture assays 
or in animal models, usually mice, rabbits, dogs, or pigs. The animal model is also used to achieve a desirable 
concentration range and route of administration. Such information can then be used to determine useful doses and 
routes for administration in humans. 

15 a therapeutically effective dose refers to that amount of protein or its antibodies, antagonists, or inhibitors 

which ameliorate the symptoms or condition. Therapeutic efficacy and toxicity of such compounds can be determined 
by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., £050 (the dose therapeutically 
effective in 50% of the population) and LD50 (the dose lethal to 50% of the population). The dose ratio of toxic to 
therapeutic effects is the therapeutic index, and it can be expressed as the ratio, LO50/E050. Pharmaceutical 

20 compositions which exhibit large therapeutic indices are preferred. The data obtained from cefl culture assays and 
animal studies is used in formulating a range of dosage for human use. The dosage of such compounds lies preferably 
within a range of circulating concentrations that include the E050 with little or no toxicity. The dosage varies within 
this range depending upon the dosage form employed, sensitivity of the patient, and the route of administration. 

The exact dosage is chosen by the individual physician in view of the patient to be treated. Dosage and 

25 administration are adjusted to provide sufficient levels of the active moiety or to maintain the desired effect. 
Additional factors which may be taken into account include the severity of the disease state, e.g., tumor size and 
location; age, weight and gender of the patient; diet, time and frequency of administration, drug combinations), 
reaction sensitivities, and tolerance/response to therapy. Long acting pharmaceutical compositions might be 
administered every 3 to 4 days, every week, or once every two weeks depending on half-Bf e and clearance rate of the 

30 particular formulation. 

Normal dosage amounts may vary from 0.1 to 100.000 micrograms, up to a total dose of about 10 g, 
depending upon the route of administration. Guidance as to particular dosages and methods of delivery is provided in 
the literature. See U.S. Pat. Nos. 4.657,760; 5.206,344; or 5,225,212. Those skilled in the art wfll employ different 
formulations for nucleotides than for proteins or their inhibitors. Similarly, delivery of polynucleotides or polypeptides 

35 will be specific to particular cells, conditions, locations, etc 
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Although the invention has been described with reference to embodiments and examples, it should be 
understood that various modifications can be made without departing from the spirit of the invention. Accordingly, 
the invention is limited only by the following claims. All references cited herein are hereby expressly incorporated by 
reference. 
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WHAT IS CLAIMED IS: 

1 . A method of identifying the presence of galectin-3 in cirrhotic or cancerous liver comprising: 

obtaining a sample of cirrhotic or cancerous liver tissue; 

providing an antibody to galectin-3; 
5 contacting the sample with the antibody; and 

analyzing the presence of galectin-3 in the sample. 
1 The method of Claim 1, wherein the cancerous liver tissue is hepatocellular carcinoma. 

3. The method of Claim 1, wherein the analysis of the presence of galectin-3 in the sample comprises 
a technique of the group consisting of fluorescence activated ceil sorting (FACS), immunoprecipitation, Western blot 

1 0 antibody staining, and a galectin-3 binding assay. 

4. A method of identifying the amount galectin-3 expression in cirrhotic or cancerous liver comprising: 
obtaining a sample of cirrhotic or cancerous liver tissue; 

providing a probe which interacts with galectin-3 or an RNA encoding galectin-3; 
contacting the sample with the probe; and 
15 analyzing the presence of galectin-3 in the sample. 

5. The method of Claim 4, wherein the cancerous fiver tissue is hepatocellular carcinoma. 

6. The method of Claim 4, wherein the probe is a member of the group consisting of a nucleic acid, a 
protein, a carbohydrate chemical agent, and a peptfdomimetk. 

7. The method of Claim 4, wherein the analysts of the presence of galectin-3 in the sample comprises 
20 a technique of the group consisting of fluorescence activated cell sorting (FACS), immunoprecipitation. Western blot, 

antibody staining, a galectin-3 binding assay, and a hybridization assay. 

8. The method of Claim 4 further comprising quantifying the amount of galectin-3 protein or RNA in 
the sample. 

9. The method of Claim 4 or 8 further comprising recording the amount of galectin-3 expression or the 
25 amount of galectin-3 on a computer readable media. 

1 0. A method of identifying a subject in need of treatment or prevention of liver disease comprising: 
obtaining a biological sample from a subject having RNA or protein, wherein the biological sample is 

obtained from the subject's liver; 

providing a probe which interacts with galectin-3 protein or RNA encoding galectm-3; 
30 contacting the biological sample with the probe under conditions which aQow the probe to interact 

with the RNA or protein in the biological sample; 

detecting the amount or probe which interacts with the RNA or protein in the biological sample so 
as to determine the amount of galectin-3 expression; 

identifying the subject as a subject in need of treatmem or prevention of fiver disease by the 
35 presence of the probe in hepatocytes. 
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11. The method of Claim 10, wherein the liver disease is cirrhosis or hepatocellular carcinoma, or any 
neoplasm originating from the liver. 

1 Z The method of Claim 1 0, wherein the biological sample comprises an hepatocyte. 

1 3. The method of Claim 10, wherein the probe is a member of the group consisting of a nucleic acid, a 
5 protein, a carbohydrate chemical agent, and a peptidomhnetic. 

H. The method of Claim 10, wherein the detection of the amount of probe which interacts with the 
RNA or protein comprises a technique of the group consisting of fluorescence activated cell sorting (FACS), 
immunoprecipitation. Western blot antibody staining, a galectin-3 binding assay, and a hybridization assay. 

15. The method of Claim 10 further comprising quantifying the amount of galectin-3 protein or RNA in 
10 the sample. 

16. The method of Claim 10 or 15 further comprising recording the amount of galectm-3 expression on 
a computer readable media. 

1 7. A method of identifying a predisposition to disease comprising; 

obtaining a biological sample from a subject, wherein the biological sample has protein or RNA; 
15 providing a plurality of probes which interact with a plurality of galectin proteins or a plurality of 

RNA sequences encoding a plurality of galectin proteins; 

contacting the protein or RNA from the biological sample with the plurality of probes under 
conditions which allow the probes to interact with galectins which may be present in the biological sample; 
detecting the probes which interact with the RNA or protein in the sample so as to determine a 
20 level of expression of more than one galectin; and 

analyzing the level of expression of more than one galectin to identify the predisposition for 

disease. 

18. The method of Claim 17, wherein the plurality of probes interact with at least two proteins of the 
group consisting of galectin* 1, galectin*2, galectin-3, galectin-4, galectin-5, galectin-5H, galectin-6, galectin-7, 

25 galectin-8, galectin-9. and galectin-10 or two RNAs having the entire coding sequence or a sequence within the entire 
coding sequence of an RNA encoding a member of the group consisting of gaiectin-1, galectin-3, galectin-4, gaiectin-5, 
galectin-5H, galectin-6, galectin-7, galectin-8, gatectin-9, and gatectin-10. 

19. The method of Claim 17, wherein the disease is cirrhosis or hepatocellular carcinoma, or any 
neoplasm originating from the liver. 

30 20. The method of Claim 1 7, wherein the biological sample comprises an hepatocyte. 

21. The method of Claim 17, wherein the plurality of probes have at least two members of the group 
consisting of a nucleic acid, a protein, a carbohydrate chemical agent, and a peptidomimetic. 

22. The method of Claim 17, wherein the detection of the amount of probe which interacts with the 
RNA or protein comprises a technique of the group consisting of fluorescence activated cell sorting (FACS), 

35 immunoprecipitation, Western blot, antfliody staining, a galectin binning assay, and a hybridization assay. 
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23. The method of Claim 17 further comprising quantifying the amount of galectin protein or RNA in 
the sample. 

24. The method of Claim 17 further comprising calculating a ratio of galectin expression, wherein the 
ratio comprises the level of expression of a first galectin compared to the level of expression of a second galectin or 

5 the ratio comprises the level of expression of a galectin from a biological sample of a first subject compared to the 
level of expression of a galectin from a biological sample from a second subject 

25. The method of Claim 17. 23, or 24 further comprising recording the amount of galectin expression 

on a computer readable media. 

26. A method of making a pharmaceutical for the treatment or prevention of hepatocellular carcinoma 
10 comprising incorporating an agent which inhibits the prevalence of gaiectin-3 in liver cells or tissue in a pharmaceutical 

formulation, wherein the agent is a member of the group consisting of an antibody to galectin-3. a peptide, a 
carbohydrate, a chemical agent a peptidomimetic which interacts with galectin-3. an anttsense oligonucleotide 
complementary to a transcript encoding galectin-3. and a riboiyme complementary to a transcript encoding galectin-3. 

27. A method of treatment or prevention of hepatocellular carcinoma or cirrhosis of the Over 
15 comprising administering an agent which inhibits the prevalence of galectin-3 in liver cells or tissue in a 

pharmaceutical formulation, wherein the agent is a member of the group consisting of an antibody to galectin-3, a 
carbohydrate, a chemical agent a peptide a peptidomimetic which interacts with galectin-3, an anttsense 
oligonucleotide complementary to a transcript encoding galectin-3, and a ribozyme complementary to a transcript 
encoding galectin-3. 

20 28. The method of Claims 26 or 27, wherein the agent is an antibody conjugated to a toxin or a 

radionuclide. 

29. A method of screening for prognosis of cirrhosis or neoplasm originating from fiver in a human 
comprising: 

(a) isolating test proteins or RNA from a biological sample having hepatocytes, said 
25 biological sample being obtained from said subject; and 

(b) determining the level of galectin-3 protein or an RNA encoding galectin-3, wherein an 
elevated level of said galectin-3 protein or RNA encoding galectin-3 indicates a positive correlation with 
development of cancer or cirrhosis of the liver, and wherein the step of determining the level of galectin-3 
protein or an RNA encoding galectin-3 is carried out by probing said galectin-3 protein or RNA encocfing 

30 gatectin-3 with an antibody which recognizes galectin-3 protein or a nucleic acid complementary to the RNA 

encoding galectin-3. 

30. A method of screening for risk of cancer or cirrhosis of the liver in a human subject comprising: 

(a) isolating test proteins or RNA from a biological sample having hepatocytes, said 
biological sample being obtained from said subject: 



-29- 



WO 00/62076 



PCTAJS00/08561 



(b) determining the level of gatectin-3 protein or an RNA encoding galectin-3, wherein an 
elevated level of said gatectin-3 protein or RNA encoding galectin-3 indicates a positive correlation with 
development of cancer or ctrrhosis of the liver, and wherein the step of determining the level of galectin-3 
protein or an RNA encoding galectin-3 is carried out by probing said galectin-3 protein or RNA encoding 

5 galectin-3 with an antibody which recognizes galectin-3 protein or a nucleic acid complementary to the RNA 

encoding gatectin-3; and 

(c) identifying the human subject as being at risk based on the level of galectin-3 protein or RNA 
encoding galectin-3 determined in step (b). 

31. The method of Claim 29 or 30 wherein said cancer is hepatocellular carcinoma or any neoplasm of 
10 liver origin. 

32. A method for determining whether an individual is suffering from hepatocellular carcinoma or 
cirrhosis of the liver or is likely to suffer to suffer from hepatocellular carcinoma or any neoplasm of liver origin or 
ctrrhosis of the fiver in the future comprising determining whether the level of galectin-3 in the liver of said individual is 
above normal. 

15 31 The method of Claim 32, wherein the step of determining whether the level of galectin-3 

expression in the liver of said individual is above normal comprises determining whether the level of galectin-3 protein 
in a biological sample obtained from the liver of said individual is above normal 

34. The method of Claim 33, wherein the step of determining whether the level of galectin-3 protein in 
a biological sample obtained from the liver of said individual is above normal comprises: 

20 contacting said biological sample with an antibody capable of specifically binding to galectin-3; and 

determining whether the level of antibody binding to said sample is greater than the level of 
antibody binding observed in samples from individuals who are not suffering from hepatocellular carcinoma 
or cirrhosis of the liver. 

35. The method of Claim 34, wherein the biological sample is a liver biopsy. 

25 36. The method of Claim 35, wherein the determining step comprises determining whether the level of 

antibody binding to hepatocytes in the liver biopsy is greater than the level observed in biopsies from individuals not 
suffering from hepatocellular carcinoma or cirrhosis of the liver. 

37. The method of Claim 32, wherein the step of determining the whether the level of galectin-3 
expression in the fiver of said individual is above normal comprises determining the level of mRNA encoding galectin-3 

30 present in a biological sample obtained from the liver of said individual. 

38. The method of Claim 37, wherein the level of galectin-3 mRNA is determined by a method from the 
group consisting of a nucleic acid amplification reaction, a nucleic acid hybridization assay, and an RNAse protection 
assay. 

39. A method for correlating the ratio between the expression level of two or more galecttns with a 
35 disease state comprising: 
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obtaining biological samples from individuals suffering from said disease state and normal 
individuals; 

measuring the expression levels of said two or more galectins in said biological samples; 
calculating a ratio between said expression levels in said samples; 
5 and determining whether there is a statistically significant association of said ratio with said 

disease state. 

40. The method of Claim 39. wherein said galectins are selected from the group consisting of galeetin- 
1, galectin-2. ga!ectin-3, gafectin-4, galectin-5, galectm 5H, galectin-7. galecu'n-8, galectto-9, and gafecttn 10. 

41. A method of determining whether an individual is suffering from or is likely to suffer from a msease 
10 in the future comprising: 

obtaining a biological sample from said individual; 

determining the expression levels of at least two galectins in said biological sample; 
calculating the ratio between the expression levels of said galectins; and 
determining whether said ratio is a ratio which is statistically correlated with a disease or a 
1 5 predisposition to a disease. 

42. The method of Claim 41, wherein said galectins are selected from the group consisting of galectin- 
1, galectin-2, galectin-3, gaJectin-4, galectin-5, galectm 5H, galectin-7, gaJectin-8, galectin-9, and galectin-10. 

43. A kit for determining galeetin expression in hepatocellular cells comprising: 
hepatocellular ceils; and 

20 a probe indicative of galeetin expression in hepatocellular cells. 

44. A reaction mixture comprising: 
hepatocellular cells of a patient; and 

a probe indicative of galeetin expression in hepatocellular cells. 

45. The method of Claims 10. 17, 29, 30, and 41 , wherein the biological sample is a body fluid. 

25 46. A method of detennining prognosis of cirrhosis or neoplasm originating from the fiver comprising: 

(a) obtaining a biological sample; 

(b) measuring the level of expression of two or more galectins; 

(c) Calculating a ratio based on the level of expression of the galectins in step (b); end 

(d) determining the prognosis of cirrhosis or neoplasm by identifying a ratio calculated in 
30 step (c) which is above normal levels of galeetin expression. 

47. The method of Claim 46. wherein the galectins measured m step {b) are at least two members of 
the group consisting of galectin-1, galectin-2, galectin-3, galectM, galectin-5, galeetin 5H. galectin-7. gatectin-8. 
galectin-9, and galectin-1 0. 
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FIGURE 2 



Human galectin-3 



1 


CCAGCCAACG 


AGCGGAAAAT 


GGCAGACAAT 


TTTTCGCTCC 


ATGATGCGTT 


51 


ATCTGGGTCT 


GGAAACCCAA 


ACCCTCAAGG 


ATGGCCTGGC 


GCATGGGGGA 


101 


ACCAGCCTGC 


TGGGGCAGGG 


GGCTACCCAG 


GGGCTTCCTA 


TCCTGGGGCC 


151 


TACCCCGGGC 


AGGCACCCCC 


AGGGGCTTAT 


CCTGGACAGG 


CACCTCCAGG 


201 


CGCCTACCAT 


GGAGCACCTG 


GAGCTTATCC 


CGGAGCACCT 


GCACCTGGAG 


251 


TCTACCCAGG 


GCCACCCAGC 


GGCCCTGGGG 


CCTACCCATC 


TTCTGGACAG 


301 


CCAAGTGCCC 


CCGGAGCCTA 


CCCTGCCACT 


GGCCCCTATG 


GCGCCCCTGC 


351 




Alibi yu. 1 1 




TTTGCCTGGG 


GGAGTGGTGC 


401 


CTCGCATGCT 


GATAACAATT 


CTGGGCACGG 


TGAAGCCCAA 


TGCAAACAGA 


451 


ATTGCTTTAG 


ATTTCCAAAG 


AGGGAATGAT 


GTTGCCTTCC 


ACTTTAACCC 


501 


ACGCTTCAAT 


GAGAACAACA 


GGAGAGTCAT 


TGTTTGCAAT ACAAAGCTGG 


551 


ATAATAACTG 


GGGAAGGGAA 


GAAAGACAGT 


CGGTTTTCCC 


ATTTGAAAGT 


601 


GGGAAACCAT 


TCAAAATACA 


AGTACTGGTT 


GAACCTGACC 


ACTTCAAGGT 


651 


TGCAGTGAAT 


GATGCTCACT 


TGTTGCAGTA 


CAATCATCGG 


GTTAAAAAAC 


701 


TCAATGAAAT 


CAGCAAACTG 


GGAATTTCTG 


GTGACATAGA 


CCTCACCAGT 


751 


GCTTCATATA 


CCATGATATA 


ATCTGAAAGG 


GGCAGATTAA 


AAAAAAAAAA 


801 


AAAGAATCTA 


AACCTTACAT 


GTGTAAAGGT 


TTCATGTTCA 


CTGTGAGTGA 


851 


AAATTTTTAC 


ATTCATCAAT 


ATCCCTCTTG 


TAAGTCATCT 


ACTTAATAAA 


901 


TATTACAGTG 


AAAG (SEQ. 


ID. NO. 47 


) 





WO 00/62076 



PCTAJSOO/08561 



SEQUENCE LISTING 



<110> HSU, Daniel* K. 
LIU, Fu-Tong 
DOWLING, Christopher, A. 

<120> GALECTIN EXPRESSION IS INDUCED IN 

CIRRHOTIC LIVER AND HEPATOCELLULAR CARCINOMA 



<130> DANHSU.001VPC 

<140> 60/129,111 
<141> 1999-04-13 

<160> 47 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 50 
<212> PRT 
<213> chicken 



<400> 1 T 

Met Gin Ala Met Lys Ala Arg Cys Trp Gin Pro His Trp Met Leu Pro 

15 10 15 

Leu Leu Pro Leu Ser Ser Pro Leu His Pro Gin Leu Ser Asp Ala Leu 

20 25 30 

Pro Ala His Asn Pro Gly Ala Pro Pro Pro Gin Gly Trp Asn Arg Pro 
35 40 45 

Pro Gly 
50 



<210> 2 
<211> 50 
<212> PRT 
<213> chicken 



<400> 2 

Pro Gly Ala Phe Pro Ala Tyr Pro Gly Tyr Pro Gly Ala Tyr Pro Gly 

x 5 10 15 

Ala Pro Gly Pro Tyr Pro Gly Ala Pro Gly Pro His His Gly Pro Pro 

20 25 30 

Gly Pro Tyr Pro Gly Gly Pro Pro Gly Pro Tyr Pro Gly Gly Pro Pro 
35 40 45 

Gly Pro 
50 



<210> 3 
<211> 27 
<212> PRT 
<213> nematode 

<400> 3 

Met Ser Ala Glu Glu Pro Lys Ser 

1 5 
Leu Gin Glu Lys Phe Glu Pro Gly 
20 



Tyr Pro Val Pro Tyr Arg Ser Val 

10 15 
Gin Thr Leu 

25 
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<210> 4 
<211> 17 
<212> PRT 
<213> eel 

<400> 4 

Ser Gly Gly Leu Gin Val Lys Asn Phe Asp Phe Thr Val Gly Lys Phe 

1 5 10 15 

Leu 



<210> 5 
<211> 43 
<212> PRT 
<213> chicken 

<400> 5 

Tyr Pro Gly Gly Pro Pro Gly Pro Tyr Pro Gly Gly Pro Thr Ala Pro 

! 5 10 15 

Tyr Ser Glu Ala Pro Ala Ala Pro Leu Lys Val Pro Tyr Asp Leu Pro 

20 25 30 

Leu Pro Ala Gly Leu Met Pro Arg Leu Leu lie 
35 40 



<210> 6 
<211> 33 
<212> PRT 
<213> rat 

<400> 6 

Met Ala Tyr Val Pro Ala Pro Gly Tyr Gin Pro Thr Tyr Asn Pro Thr 

15 10 15 

Leu Pro Tyr Lys Arg Pro He Pro Gly Gly Leu Ser Val Gly Met Ser 
20 25 30 

He 



<210> 7 
<211> 12 
<212> PRT 
<213> mouse 

<400> 7 

Pro He Pro Gly Gly Leu Ser Val Gly Met Ser Val 
1 5 10 



<210> 8 
<211> 18 
<212> PRT 
<213> human 

<400> 8 

Met Ala Cys Gly Leu Val Ala Ser Asn Leu Asn Leu Lys Pro Gly Glu 

15 10 15 

Cys Leu 
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<210> 9 
<211> 33 
<212> PRT 
<213> human 

<400> 9 

Met Ala Tyr Val Pro Ala Pro Gly Tyr Gin Pro Thr Tyr Asn Pro Thr 

x 5 10 15 

Leu Pro Tyr Tyr Gin Pro lie Pro Gly Gly Leu Asn Val Gly Met Ser 
20 25 30 

Val 



<210> 10 
<211> 42 
<212> PRT 
<213> nematode 

<400> 10 

lie Val Lys Gly Ser Thr He Asp Glu Ser Gin Arg Phe Thr He Asn 

I 5 10 15 

Leu His Ser Lys Thr Ala Asp Phe Ser Gly Asn Asp Val Pro Leu His 

20 25 30 

Val Ser Val Arg Phe Asp Glu Gly Lys He 
35 40 



<210> 11 
<211> 41 
<212> PRT 
<213> eel 

<400> 11 

Thr Val Gly Gly Phe He Asn Asn Ser Pro Gin Arg Phe Ser Val Asn 

1 5 10 15 

Val Gly Glu Ser Met Asn Ser Leu Ser Leu His Leu Asp His Arg Phe 

20 25 30 

Asn Tyr Gly Ala Asp Gin Asn Thr He 
35 40 



<210> 12 
<211> 39 
<212> PRT 
<213> chicken 



<400> 12 

Thr He Thr Gly Thr Val Asn Ser 

1 5 
Phe Lys Arg Gly Gin Asp He Ala 
20 

Glu Asp His Lys Arg Val He 
35 



Asn Pro Asn Arg Phe Ser Leu Asp 

10 15 
Phe His Phe Asn Pro Arg Phe Lys 
25 30 



<210> 13 
<211> 41 
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<2X2> PRT 
<213> rat 

<400> 13 

Tvr lie Gin Gly He Ala Lys Asp Asn Met Arg Arg Phe His Val Asn 

1 5 10 . 15 

Phe Ala Val Gly Gin Asp Glu Gly Ala Asp He Ala Phe His Phe Asn 

20 25 30 

Pro Arg Phe Asp Gly Trp Asp Lys Val 
35 40 



<210> 14 
<211> 41 
<212> PRT 
<213> mouse 

<400> 14 

Tyr He Gin Gly Met Ala Lys Glu Asn Met Arg Arg Phe His Val Asn 

1 5 10 15 

Phe Ala Val Gly Gin Asp Asp Gly Ala Asp Val Ala Phe His Phe Asn 

20 25 30 

Pro Arg Phe Asp Gly Trp Asp Lys Val 
35 40 



<210> 15 
<211> 41 
<212> PRT 
<213> human 

<400> 15 

Arg Val Arg Gly Glu Val Ala Pro Asp Ala Lys Ser Phe Val Leu Asn 

I 5 10 15 

Leu Gly Lys Asp Ser Asn Asn Leu Cys Leu His Phe Asn Pro Arg Phe 

20 25 30 

Asn Ala His Gly Asp Ala Asn Thr He 
35 40 



<210> 16 
<211> 41 
<212> PRT 
<213> human 

<400> 16 

Tyr He Gin Gly Val Ala Ser Glu His Met Lys Arg Phe Phe Val Asn 

1 5 10 ■" 15 

Phe Val Val Gly Gin Asp Pro Gly Ser Asp Val Aia Phe His Phe Asn 

20 25 30 

Pro Arg Phe Asp Gly Trp Asp Lys Val 
35 40 



<210> 17 
<211> 44 
<212> PRT 
<213> nematode 

<400> 17 

Val Leu Asn Ser Phe Ser Asn Gly Glu Trp Gly Lys Glu Glu Arg Lys 
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I 5 10 15 

Ser Asn Pro He Lys Lys Gly Asp Ser Phe Asp He Arg He Arg Ala 

20 25 30 

His Asp Asp Arg Phe Gin lie He Val Asp His Lys 
35 " 40 



<210> 18 
<211> 48 
<212> PRT 
<213> eel 

<400> 18 

Val Met Asn Ser Thr Leu Lys Gly Asp Asn Gly Trp Glu Thr Glu Gin 

1 5 10 15 

Arq Ser Thr Asn Phe Thr Leu Ser Ala Gly Gin Tyr Phe Glu He Thr 

20 25 30 

Leu Ser Tyr Asp He Asn Lys Phe Tyr He Asp He Leu Asp Gly Pro 
35 * 40 45 



<210> 19 
<211> 46 
<212> PRT 
<213> chicken 

<400> 19 

Val Cys Asn Ser Met Phe Gin Asn Asn Trp Gly Lys Glu Glu Arg Thr 

1 5 10 15 

Ala Pro Arg Phe Pro Phe Glu Pro Gly Thr Pro Phe Lys Leu Gin Val 

20 25 30 

Leu Gys Glu Gly Asp His Phe Lys Val Ala Val Asn Asp Ala 
35 40 45 



<210> 20 
<211> 45 
<212> PRT 
<213> rat 

<400> 20 

Val Phe Asn Thr Met Gin Ser Gly Gin Trp Gly Lys Glu Glu Lys Lys 

! 5 10 15 

Lys Ser Met Pro Phe Gin Lys Gly His His Phe Glu Leu Val Phe Met 

20 25 30 

Val Met Ser Glu His Tyr Lys Val Val Val Asn Gly Thr 
35 40 . 45 



21 
45 
PRT 
mouse 

<400> 21 

Val Phe Lys Thr Met Gin Ser Gly Gin Trp Gly Lys Glu Glu Lys Lys 

15 10 15 

Lys Ser Met Pro Phe Gin Lys Gly Lys His Phe Glu Leu Val Phe Met 

20 25 30 

Val Met Pro Glu His Tyr Lys Val Val Val Asn Gly Asn 
35 40 45 
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<210> 22 
<211> 46 
<212> PRT 
<213> human 



<400> 22 

Val Cys Asn Ser Lys Asp Gly Gly Ala Trp Gly Thr Glu Gin Arg Glu 

1 5 10 15 

Ala Val Phe Pro Phe Gin Pro Gly Ser Val Ala Glu Val Cys lie Thr 

20 25 30 

Phe Asp Gin Ala Asn Leu Thr Val Lys Leu Pro Asp Gly Tyr 
35 40 45 



<210> 23 
<211> 45 
<212> PRT 
<213> human 

<400> 23 

Val Phe Asn Thr Leu Gin Gly Gly Lys Trp Gly Ser Glu Glu Arg Lys 

I 5 10 15 

Arg Ser Met Pro Phe Lys Lys Gly Ala Ala Phe Glu Leu Val Phe He 

20 25 30 

Val Met Ala Glu His Tyr Lys Val Val Val Asn Gly Asn 
35 40 45 



<210> 24 
<211> 34 
<212> PRT 
<213> nematode 

<400> 24 

Glu Phe Lys Asp Tyr Glu His Arg 

1 5 
Leu Ser He Asp Gly Asp Leu Tyr 
20 

Lys Tyr 



Leu Pro Leu Ser Ser He Ser His 

10 15 

Leu Asn His Val His Trp Gly Gly 

25 30 



<210> 25 
<211> 29 
<212> PRT 
<213> eel 

<400> 25 

Asn Leu Glu Phe Pro Asn Arg Tyr Ser Lys Glu Phe Leu Pro Phe Leu 

15 10 15 

Ser Leu Ala Gly Asd Ala Arg Leu Thr Leu Val Lys Glu 
20 25 



<210> 26 
<211> 34 
<212> PRT 
<213> chicken 
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<400> 26 

His Leu Leu Gin Phe Asn Phe Arg 

1 5 
Lys Leu Cys He Ala Gly Asp He 
20 

Met He 



Glu Lys Lys Leu Asn Gly He Thr 

10 15 

Thr Leu Thr Ser Val Leu Thr Ser 

25 30 



<210> 27 
<211> 47 
<212> PRT 
<213> rat 



<400> 27 

Pro Phe Tyr Glu Tyr Gly His Arg Leu Pro Leu Gin Met Val Thr His 

! 5 10 15 

Leu Gin Val Asp Gly Asp Leu Glu Leu Gin Ser He Asn Phe Leu Gly 

20 25 30 

Gly Gin Pro Ala Ala Ser Gin Tyr Pro Gly Thr Met Thr He Pro 
35 40 45 



<210> 28 
<211> 47 
<212> PRT 
<213> mouse 



<400> 28 

Ser Phe Tyr Glu Tyr Gly His Arg Leu Pro Val Gin Met Val Thr His 

15 10 15 

Leu Gin Val Asp Gly Asp Leu Glu Leu Gin Ser He Asn Phe Leu Gly 

20 25 30 

Gly Gin Pro Ala Ala Ala Pro Tyr Ala Gly Ala Met Thr He Pro 
35 40 45 



<210> 29 
<211> 30 
<212> PRT 
<213> human 

<400> 29 

Glu Phe Lys Phe Pro Asn Arg Leu Asn Leu Glu Ala He Asn Tyr Met 

1 5 10 15 

Ala Ala Asp Gly Asp Phe Lys He Lys Cys Val Ala Phe Asp 
20 25 30 



<210> 30 
<211> 45 
<212> PRT 
<213> human 



<400> 30 

Pro Phe Tyr Glu Tyr Gly His Arg Leu Pro Leu Gin Met Val Thr His 

I 5 10 15 

Leu Gin Val Asp Gly Asp Leu Gin Leu Gin Ser He Asn Phe He Gly 

20 25 30 

Gly Gin Pro Leu Arg Pro Gin Gly Pro Pro Met Met Pro 
35 40 45 
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<210> 31 
<211> 23 
<212> PRT 
<213> nematode 

<400> 31 

Tyr Pro Val Pro Tyr Glu Ser Gly Leu Ala Asn Gly Leu Pro Val Gly 

1 5 10 15 

Lys Ser Leu Leu Val Phe Gly 
20 



<210> 32 
<211> 47 
<212> PRT 
<213> rat 

<400> 32 

Ala Tyr Pro Ser Ala Gly Tyr Asn Pro Gin Met Asn Ser Leu Pro Val 

15 10 15 

Met Ala Gly Pro Pro lie Phe Asn Pro Pro Val Pro Tyr Val Gly Thr 

20 25 30 

Leu Gin Gly Gly Leu Thr Ala Arg Arg Thr He He He Lys Gly 
35 40 45 



<210> 33 
<211> 50 
<212> PRT 
<213> mouse 

<400> 33 

Ala Tyr Pro Ala Gly Ser Pro Gly Tyr Asn Pro Pro Gin Met Asn Thr 

1 5 10 15 

Leu Pro Val Met Thr Gly Pro Pro Val Phe Asn Pro Arg Val Pro Tyr 

20 25 30 

Val Gly Ala Leu Gin Gly Gly Leu Thr Leu Pro Arg Thr He He He 
35 40 45 

Lys Gly 
50 



<210> 34 
<211> 47 
<212> PRT 
<213> human 

<400> 34 

Pro Tyr Pro Gly Pro Gly His Cys His Gin Gin Leu Asn Ser Leu Pro 

1 5 10 15 

Thr Met Glu Gly Pro Pro Thr Phe Asn Pro Val Pro Tyr Phe Gly Arg 

20 25 30 

Leu Gin Gly Gly Leu Thr Ala Arg Arg Thr He He He Lys Gly 
35 40 45 



<210> 35 
<211> 49 
<212> PRT 
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<213> nematode 
<400> 35 

Thr Val Glu Lys Lys Ala Lys Arg 

1 5 
Asn Gly Asp lie Ser Phe His Phe 
20 

Val lie Arg Asn Ser Leu Ala Ala 
35 40 

Glu 



Phe His Val Asn Leu Leu Arg Lys 

10 15 
Asn Pro Arg Phe Asp Glu Lys His 
25 30 
Asn Glu Trp Gly Asn Glu Glu Arg 
45 



<210> 36 
<211> 49 
<212> PRT 
<213> rat 

<400> 36 , , 

Tyr Val Leu Pro Thr Ala Lys Asn Leu lie lie Asn Phe Lys Val Gly 

15 10 15 

Ser Thr Gly Asp He Ala Phe His Met Asn Pro Arg He Gly Asp Cys 

20 25 30 

Val Val Arg Asn Ser Tyr Met Asn Gly Ser Trp Gly Ser Glu Glu Arg 
35 40 45 

Lys 



<210> 37 
<2U> 49 
<212> PRT 
<213> mouse 

<400> 37 

Tyr Val Leu Pro Thr Ala Arg Asn Phe Val He Asn Phe Lys Val Gly 

15 10 15 

Ser Ser Gly Asp He Ala Leu His Leu Asn Pro Arg He Gly Asp Ser 

20 25 30 

Val Val Arg Asn Ser Phe Met Asn Gly Ser Trp Gly Ala Glu Glu Arg 
35 40 45 

Lys 



<210> 38 
<211> 49 
<212> PRT 
<213> human 

<400> 38 

Tyr Val Pro Pro Thr Gly Lys Ser Phe Ala He Asn Phe Lys Val Gly 

1 5 10 15 

Ser Ser Gly Asp He Ala Leu His He Asn Pro His Gly Asn Gly Thr 

20 25 30 

Val Val Arg Asn Ser Leu Leu Asn Gly Ser Trp Gly Ser Glu Glu Lys 
35 40 45 

Lys 
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<210> 39 
<211> 48 
<212> PRT 
<213> nematode 



<400> 39 

lie Pro Tyr Asn Pro Phe Gly Ala 

1 5 
Arg Cys Gly Thr Asp Arg Phe Lys 
20 

Phe Asp Phe Ser His Arg Phe Gin 
35 40 



Gly Gin Phe Phe Asp Leu Ser lie 

10 15 
Val Phe Ala Asn Gly Gin His Leu 
25 30 
Ala Phe Gin Arg Val Asp Met Leu 
45 



<210> 40 
<211> 48 
<212> PRT 
<213> rat 



<400> 40 

Val Ala Tyr Asn Pro Phe Gly Pro 

1 5 
Arg Cys Gly Met Asp Arg Phe Lys 
20 

Phe Asp Phe Ser His Arg Phe Gin 
35 40 



Gly Gin Phe Phe Asp Leu Ser lie 

10 15 
Val Phe Ala Asn Gly Gin His Leu 
25 30 
Ala Phe Gin Met Val Asp Thr Leu 
45 



<210> 41 
<211> 48 
<212> PRT 
<213> mouse 



<400> 41 

Thr Thr His Asn Pro Phe Gly Pro Gly Gin Phe Phe Asp Leu Ser lie 

15 10 15 

Arg Cys Gly Leu Asp Arg Phe Lys Val Tyr Ala Asn Gly Gin His Leu 

20 25 30 

Phe Asp Phe Ala His Pro Ser Arg Ala Phe Gin Arg Val Asp Thr Leu 
35 40 45 



<210> 42 
<211> 48 
<212> PRT 
<213> human 



<400> 42 

Thr Thr His Asn Pro Phe Gly Pro Gly Gin Phe Phe Asp Leu Ser lie 

15 10 15 

Arg Cys Gly Leu Asp Arg Phe Lys Val Tyr Ala Asn Gly Gin His Leu 

20 25 30 

Phe Asp Phe Ala His Pro Ser Arg Ala Phe Gin Arg Val Asp Thr Leu 
35 40 45 



<210> 43 
<211> 14 
<212> PRT 
<213> nematode 
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<400> 43 

Gin He Ser Gly Asp He GJu Leu Ser Gly He Gin He Gin 



1 



<210> 44 
<211> 13 
<212> PRT 
<213> rat 



5 10 



<4O0> 44 

Glu He Lys Gly Asp He Thr Leu Ser Tyr Val Gin He 
1 5 10 



<210> 45 
<211> 8 
<212> PRT 
<213> mouse 

<400> 45 

Glu He Asn Gly Asp He Thr Leu 
1 5 



<210> 46 
<211> 13 
<212> PRT 
<213> human 

<4O0> 46 

Glu lie Gin Gly Asp Val Thr Leu Ser Tyr Val Gin He 
1 5 10 



<210> 47 
<211> 914 
<212> PRT 
<213> human 

<400> 47 

Cys Cys Ala Gly Cys Cys Ala Ala Cys Gly Ala Gly Cys Gly Gly Ala 

1 5 10 I s 

Ala Ala Ala Thr Gly Gly Cys Ala Gly Ala Cys Ala Ala Thr Thr Thr 

20 25 30 

Thr Thr Cys Gly Cys Thr Cys Cys Ala Thr Gly Ala Thr Gly Cys Gly 

35 40 45 

Thr Thr Ala Thr Cys Thr Gly Gly Gly Thr Cys Thr Gly Gly Ala Ala 

50 55 60 

Ala Cys Cys Cys Ala Ala Ala Cys Cys Cys Thr Cys Ala Ala Gly Gly 
65 70 75 80 

Ala Thr Gly Gly Cys Cys Thr Gly Gly Cys Gly Cys Ala Thr Gly Gly 

85 90 95 

Gly Gly Gly Ala Ala Cys Cys Ala Gly Cys Cys Thr Gly Cys Thr Gly 

100 105 HO 

Gly Gly Gly Cys Ala Gly Gly Gly Gly Gly Cys Thr Ala Cys Cys Cys 

H5 120 125 

Ala Gly Gly Gly Gly Cys Thr Thr Cys Cys Thr Ala Thr Cys Cys Thr 

130 135 140 

Gly Gly Gly Gly Cys Cys Thr Ala Cys Cys Cys Cys Gly Gly Gly Cys 
145 150 155 160 
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